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China's wind-power potential 


The nation can lead the world in wind energy — but its policies need to be more coherent. 


all the right things are happening. The country has doubled 

its capacity every year for the past three years; in 2007, it had 
surpassed a 5-gigawatt target three years ahead of schedule and, in 
2008, it hit a revised 10-gigawatt target two years early. Domestic 
manufacturers are positioned to produce more wind turbines than 
any other country over the next three years. 

But the reality is much more complicated. Many wind projects do 
not even get off the ground because power companies cannot make 
enough money from them. Those that do go forward often produce 
turbines that simply sit, waiting for four months or more — a big 
delay in financial terms — to be hooked up to the electricity grid. 
And even when they are connected, they break down more often and 
are much less efficient at producing energy than those in many other 
countries (see page 372). 

In May 2007, the Global Wind Energy Council, the Chinese 
Renewable Energy Industries Association, and the China Wind 
Energy Association released a joint memorandum proposing 
improvements in the industry. More than a year and a half later, the 
concerns expressed in the memorandum have been borne out and 
the recommendations remain just as pertinent. 

For example, the bidding system used by the government to 
appoint developers favours companies that agree to supply electric- 
ity at cheaper prices — even if that price will render them unprofit- 
able. As a result, many projects haven't even got started. International 
development companies with more experience and foreign turbine- 
makers with more efficient machines don't even bother to bid. The 
memorandum recommends setting ‘feed-in tariffs, which offer a 
guaranteed rate for power supplied and offer developers more con- 
sistency and planning. China has moved in this direction, but the 
process of allocating the projects is still opaque and, from the per- 
spective of developers and turbine-makers, frustrating. 

The country also needs to significantly improve its grid, and to 
coordinate it with renewable-energy developments. Grid companies 


\ quick glance at China’s wind-energy statistics suggests that 


are understandably not keen to embrace energy produced by wind — 
an erratic and relatively expensive source — so it will take incentives 
to make the grid companies want to play ball. 

Another way that China has discouraged foreign developers from 
entering the fray is by preventing companies with less than 51% 
Chinese ownership from taking advantage of the Clean Development 
Mechanism, which allows developed countries to offset their carbon- 
reduction commitments under the Kyoto Protocol by investing in 
sustainable-energy projects in developing countries. If one has to be 
protectionist, surely it is better to do so ina way that bolsters domestic 
companies rather than simply penalizing “Chinaneeds 
foreign ones? Although some of China's eae 

to significantly 


policies have used this approach — it | : ; 
requires, for instance, that 70% ofturbine Improve its grid, and 
to coordinate it with 


parts be produced locally, encouraging 
sae aa ease to build caer renewable-energy 

ing facilities in China — it would do we " 
aoe this across the board. developments. 

China could still learn a lot about turbine manufacturing and wind- 
farm maintenance and management from countries that have much 
more experience in wind energy. One place to start would be to end 
its obsession with the number and capacity of its turbines and focus 
instead on producing power from them. 

The speed with which the nation has scaled up to 10 gigawatts of 
wind energy is impressive. But if China could harness the 3,000 or so 
gigawatts of wind estimated to be available in the country, it would be 
able to cover almost all of its current electricity demand. That figure 
wont be achieved any time soon. But China should make good on plans 
to hit a more reasonable target — upping its 2020 projection from 30 
gigawatts to 100 gigawatts. Operating at international standards of effi- 
ciency could produce 5% of the nation’s energy needs and, depending 
on US policy over the next few years, make China the biggest producer 
of wind energy in the world. But only if China takes a more aggressive 
and rational approach will it make the most of its wind. a 


Dismal no more 


Europe's Joint Research Centre should be 
empowered to stimulate other EU institutions. 


parliament, with the European Commission as its execu- 
tive body. But there is another component that is generally 
unloved and yet has a crucial role: the Joint Research Centre (JRC). 
The seven large institutes that make up the JRC have remits that range 
from energy and environment to health and security. With a budget 
that approaches €400 million (US$520 million) per year, the JRC is 


Te European Union is run by its council of ministers and its 


responsible for providing the scientific and technical support for EU 
policy. And according to a fairly positive evaluation released last week, 
led by David King, former science adviser to the British government, 
it now does this quite well. 

The dismal reputation of the JRC originated in 1980s, after changing 
priorities in Europe had forced the centre to shift from its original man- 
date of researching nuclear energy, safety and security. It reinvented 
itself, and diversified its research fields, in a largely undirected and 
unmonitored way. Politicians soon started to complain about its lack 
ofa clear mission and its inefficiency. In 1998, the institutes’ labs were 
relaunched with a new, customer-orientated mission, tightly harness- 
ing them to EU policy support and fulfilment. 

The various JRC centres are now reasonably efficient machines, 
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pretty well stripped of the dead wood that had been dragging them 
down. They fulfil just about every task their customers — that is, the EU 
bodies — set them, often with great success. The King report acknowl- 
edges, for example, that the biotechnology division has become the 
world leader in setting standards for detecting and monitoring geneti- 
cally modified organisms in agriculture and food, while also playing a 
key part in the implementation of the controversial new Registration, 
Evaluation, Authorisation and Restriction of Chemical substances 
(REACH) regulations. 

In the process of becoming customer orientated, however, the JRC 
lost its independent research activities. Does this matter, given that they 
were not highly regarded anyway? 

Yes, it matters. The role of science in policy has become ever more 
important. Crucial decisions about climate, energy and the like rely on 
the highest quality of scientific information. JRC scientists need to be 
able to interact intelligently with cutting-edge research in the academic 
community, and to extend it appropriately. 

But high-level scientists need an appropriate intellectual environ- 
ment to work in, and this is where the JRC has had its hands tied. Right 
now, JRC researchers are running to stand still. As new tasks stream 
in from their demanding customers, research projects enthusiasti- 


cally started during a previous task are dropped — even if they might 
have had long-term value for policy. The scientists tend not to publish 
much in academic journals, as this has not been considered one of 
their ‘deliverables: Most JRC science is written up in internal reports. 
These are not ideal conditions for attracting and keeping top, ambi- 
tious scientists. 

This is why the King report's proposal that a small proportion of the 
budget be directed into exploratory work is welcome, as are two other 
radical proposals, to some extent related. 

First, the JRC needs to develop its own long-term strategies so that 
it can plan more rationally. Although it has 2,750 staff, the scatter-gun 
tasks required of it spreads them too thinly. So the strategies should 
also set out criteria for which tasks to accept so that critical mass can 
be achieved from limited human resources. 

Second, the JRC needs more political responsibility. Given its accu- 
mulated knowledge and experience, and its close contact with the sci- 
entific community, it is well placed to see problems coming. Structures 
should be set up to allow it to advise, not just serve, its customers. 

In short, the JRC needs to be allowed to grow up. Europe's council, 
parliament and commission now need to shed their distrust and make 
the JRC an even more useful institution than it has already become. m 


Choosing a world 


Titan is a slightly more appealing lunar target than 
Europa for the next outer-planets mission. 


n just five decades, planetary spacecraft have provided an extraor- 
dinary wealth of discoveries: the oddly young surface of Venus, 
the ancient landscapes of Mars, the volcanoes of Io, the geysers of 
Triton, the lakes of Titan, the ocean of Europa. But two of the most 
sought-after things have not been discovered. One is life. The other 
is how to explore space cheaply. 

The eye-watering expense of major missions imposes on the plan- 
etary enterprise a glacial tempo that is at odds with the drama of its 
discoveries. This is particularly so the further they get from Earth. 
The United States is currently deciding whether it should send its next 
‘flagship’ outer-planets mission to Europa, an ice-covered moon of 
Jupiter, or Titan, the thick-atmosphered moon of Saturn (see page 
366). The decision — in which Europe, as a partner in the mission, 
also has a stake — will shape the lives and interests of researchers 
through to the 2040s. But with total costs of more than US$3 billion, 
only one mission can be afforded. 

Until recently, Europa was widely tipped as the favourite. Its ice- 
covered ocean was one of the two most promising sites the Solar 
System offers for life beyond Earth. Indeed, it might have greater 
attractions than the other prime location, the subsurface of Mars; 
given that impacts can send meteorites flying between Mars and 
Earth, there is a real chance that any Martian life found would be 
related to that on Earth. Europa, which is much more isolated, is a 
better bet for an independent origin of life. 

Unfortunately, to do full justice to such possibilities would 
require measurements from Europa’ surface, or even below it — 
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measurements that technology cannot yet offer. Instead, the proposed 
Europa mission would be an orbiter restricted to producing images in 
various wavelengths, including those from ice-penetrating radar. The 
data would bring some closure to arguments about the thickness of the 
crust (see page 384) and might identify places where the subsurface 
can be accessed most easily. But in the end, an orbiter would be only 
a precursor to the next act — a landing mission, perhaps with drilling 
capability — the dénouement of which may be half a century away. 

A Titan mission, by contrast, would be unlikely to encounter life, 
but would be much more intimately involved with its environment. 
It would include a Titan orbiter with radar and other instruments 
to map the moon far more thoroughly : 
than the Cassini mission's ongoing fly-bys “it is hard to do 
can; a European lander designed to float Science when new 
on one of the hydrocarbon lakes; anda datacomeina 
hot-air balloon (or more accurately, a splurge every few 
slightly-less-cold-air balloon) that would dasades® 
drift around the moon studying its hazy : 
atmosphere and rich landscapes of channels, lakes and dunes. 

The Titan mission would also offer technological firsts, including 
the floating lander and that undeniably romantic hot-air balloon. 
Although floating on a lake is a skill for which there seems to be no 
further call in this Solar System, ballooning could be used to explore 
many other planets and their moons, and developing this capability 
could be seen as an investment in that future. 

It is hard to do science when new data come in a splurge every few 
decades. Gifted people go elsewhere, those who remain get locked 
in irresolvable debate, hypotheses calcify into dogma. But at the 
moment there seems to be little that can be done to break the long, 
drawn-out rhythm of feast and famine. And if we must choose, then 
floating below the rings of Saturn seems marginally more appetizing 
than above the hard ice of Europa. 2 
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Squid suckers scrutinized 


Adv. Mater. doi:10.1002/adma.200801197 (2009) 
The fearsome suckers with which the 
Humboldt squid (Dosidicus gigas) clutches 
its prey are lined with toothed rings. David 
Kisailus at the University of California, 
Riverside, Henrik Birkedal at the University 
of Aarhus in Denmark and their colleagues 
have peered into the jaws of these suckers to 
find out what makes them so strong. 

The sucker rings have sharp teeth made 
from parallel tubes that are hollow near the 
circular base and filled at the sharp end. This 
channel-like structure directly affects the 
mechanical properties of the sucker rings, 
the authors say, adding extra grip and shear 
strength for terrorising prey. 

Surprisingly, the sucker rings don’t 
contain chitin, usually present in the shells of 
crustaceans. The sucker rings’ major amino 
acids are glycine, tyrosine and histidine. The 
authors propose that hydrogen bonds between 
histidine residues give the rings their rigidity. 


Blockade boost 


Nature Phys. doi:10.1038/nphys1183 and 
doi:10.1038/nphys1178 (2009) 
The groups of Antoine Browaeys of the 
University of Paris-South in Palaiseau and 
Mark Saffman of the University of Wisconsin, 
Madison, have independently demonstrated 
a simple set-up that could form the basis of 
a quantum logic gate, a system that has the 
advantage of having components widely 
separated enough to be easily addressable one 
at a time. 

Their Rydberg blockades hold rubidium 
atoms in optical traps several micrometres 
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apart. Lasers are used to excite an electron 
in one atom to an extent that blocks its 
neighbour from achieving a similar excited 
state. The next step will be to demonstrate 
useful entanglement by creating a working 
logic gate with the blockade. 


Herd of hearing 


Proc. R. Soc. B doi:10.1098/rspb.2008.1390 (2009) 
If you want to know what Archaeopteryx 
sounded like, a first step is to work out what it 
could hear. 

Working with specimens of 59 extant 
species of reptile and bird (including the 
barn owl, skull pictured left), Paul Barrett 
of London’s Natural History Museum and 
his colleagues measured the length of a duct 
found in the bony part of the inner ear. They 
show that this length correlates with the 
hearing range and best hearing frequency of 
the animals, and can also be used as a guide 
to the complexity of their calls. 

This relationship allows the first 
quantitative assessment of the hearing ability 
of extinct species: Archaeopteryx seems to 
have been a fair match for the emu in this 
regard. More data may allow inferences to be 
made about the bird’s vocalizations, and thus 
reveal what sort of social structure it lived in. 
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Getting their morph on 


Science 323, 362-365 (2009) 

Some precipitates take regular, sinuous 
and oddly lifelike forms, and are known 
as biomorphs. Juan Manuel Garcia-Ruiz 
of the University of Granada, Spain, and 
his colleagues describe an intriguing 
chemical feedback mechanism that 
creates the microcrystals responsible. 
In biomorphs built from barium 
carbonate crystals, the formation 

of these elongated crystals has the 
effect of locally reducing pH, which 
allows the precipitation of silica onto 
the crystals, halting their growth and 
defining their shape. The properties of 
these microcrystals, which the authors 
observed using time-lapse video 
microscopy and electron microscopy, 
are responsible for the smooth curves 
and furled edges of the biomorphs. 

The work opens the way for new 
approaches to the synthesis of 
biological and biomimetic materials, 
and to the exclusion of false positives 
when looking for life-like forms in poorly 
characterized environments. 


Serotonin and social anxiety 


PLoS One 4, e4156 (2009) 

Certain versions of a key gene for the brain 
regulator serotonin disproportionately 
predispose those with a history of childhood 
abuse to depression and alcoholism. 

Rhesus macaques and humans have 
similar variation in the serotonin transporter 
gene. Work by Karli Watson of Duke 
University in Durham, North Carolina 
and two colleagues shows that the animals 
show similar behaviour if they possess one 
‘short version of the gene. These macaques 
demonstrated the simian equivalent of social 
anxiety by avoiding looking at pictures of 
faces, and particularly avoiding looking 
them in the eye. 


Martian methane 


Science 10.1126/science.1165243 (2009) 

The detection of methane in the atmosphere 
of Mars five years ago was a surprise, 
because the gas is not stable under Martian 
conditions. Michael Mumma at NASA’ 
Goddard Space Flight Center in Greenbelt, 
Maryland, one of those who made the initial 
reports, and his colleagues now provide 
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details about how much methane can be 
detected on the planet and where. 

Using data from two high-resolution 
spectrometers on Hawaii, they report 
evidence for periodic, localized plumes of 
methane in the highland region known as 
Arabia Terra. By comparing measurements 
made at different times they estimate that 
the gas has an atmospheric lifetime of at 
most a few Earth years. The mechanism 
of its removal, like the mechanism of its 
production, is currently unknown. 


ENTOMOLOGY 


Hammers of the wasps 


Biol. J. Linn. Soc. 96, 82-102 (2009) 

A number of parasitic wasp species have 
independently evolved echolocation 
techniques to find host insects deep within 
trees. 

Several species of parasitic wasp attack 
beetle larvae living inside wood, leading 
researchers to wonder how they find their 
prey. To this end, Nina Laurenne at the 
Museum of Natural History in Helsinki and 
her colleagues have surveyed the hammer- 


shaped antennal tips that these species whack 


against the trees. This hammering allows the 
wasps to locate the regions where they are 
wont to find their prey. 

A phylogenetic analysis conducted by the 
researchers suggests that these hammers are 
not a one-off innovation and have appeared 
and disappeared during the course of 


evolution, seemingly in response to the needs 


of wasp species moving into this niche. 


NEUROBIOLOGY 


Scent slides away 


Neuron 61, 57-79 (2009) 

To adjust its behaviour to suit ever-changing 
environments, an animal's sensory neurons 
must not only be able to respond to a change 
— such as a new smell — but also to gauge 
when the novelty has worn off, and the 
response is no longer appropriate. 

The olfactory neurons of the tiny worm 
Caenorhabditis elegans achieve this by 
synthesising a key adaptation protein at 
just the right place and time: in the neuron’s 
sensory cilia when the odour is first 
encountered. 

Noelle LEtoile from the University of 
California, Davis, and her colleagues have 
now shown that this synthesis is increased 
by RNA-binding proteins known as PUF — 
which is surprising because PUF proteins 
suppress protein synthesis during the 
adaptation of neurons that occurs in the 
course of development. 


CLIMATE 
De-fogged 


Nature Geosci. 10.1038/ngeo414 (2009) 

During the past 30 years, the skies of Europe 
have become clearer. Robert Vautard of the 
Laboratory for Climatic and Environmental 
Science (LSCE) in Gif-sur-Yvette, France, 


and his colleagues studied records of visibility 


at weather stations across the continent. 
They found that low-visibility conditions 


such as fog, mist and haze have declined by as 


much as 50% over the period. The pattern of 


improvement is correlated with local declines 


in sulphur dioxide emissions, suggesting a 
role for pollution control. 

This brightening of the skies, the 
researchers say, could have contributed to 
Europe's reported daytime warming during 
that time by 10-20%, with a particularly 
marked effect in eastern Europe. 


EVOLUTION 
Run rabbit run 


Proc. Nat! Acad. Sci. USA 106, 952-954 (2009) 
Predation by humans drives changes in 
exploited prey much faster than other 
evolutionary pressures do. 

Previous research has shown that 
commercial fishers and trophy hunters 
can mould traits such as average size at 
reproductive age in wild populations, but 
no single study has revealed the pace at 
which these changes generally take place. In 
a meta-analysis of work on the morphology 
and life histories of 29 species, including fish, 
mammals and plants, Chris Darimont of the 
University of California, Santa Cruz and his 
colleagues found that changes in human- 
harvested systems occurred more than 300% 
faster than in natural systems, and 50% faster 
than in systems affected by other human 
influences, such as pollution. 

The authors suggest that human predation 


works so quickly because it is often felt by large 


proportions of the adults in populations. 
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Paul Knoepfler 
University of California, 
Davis 


A cell biologist looks at the risk 
and promise of a new insight into 
stem cells and cancer. 


I study both stem and tumour 
cells, and am fascinated by their 
close relationship. Both exhibit 
pluripotency — the capacity to 
develop into any cell type — and 
the ability to cause cancer. Even 
some apparently normal stem 
cells can cause tumours, whereas 
others, sometimes from the same 
culture, lack this power. It seems 
that not all stem cells are created 
equal — even inthe same dish. 

A recent paper from Mickie 
Bhatia’s group (T. E. Werbowetski- 
Ogilvie et al. Nature Biotechnol. 27, 
91-97; 2009) is the first to directly 
address this heterogeneity in 
human embryonic stem cell (ESC) 
cultures. The team found that 
individual human ESC lines contain 
significant subpopulations that 
vary ina number of ways, including 
in tumorigenicity. 

Variant human ESC lines were 
about 20 times more tumorigenic 
than the cultures they had been 
derived from and showed small 
changes in chromosome structure. 
These could be identified by 
array-based comparative genomic 
hybridization (aCGH), but were 
not detectable by standard 
karyotyping. Thus for ‘normal’ stem 
cells being considered for use in 
regenerative medicine, karyotyping 
is not enough. Screening should 
also include aCGH, and perhaps 
an analysis of gene-expression 
patterns. 

This previously covert diversity 
has implications for both tumour 
biology and medical applications 
involving stem cells. It may shed 
light on the ‘locked in’ self-renewal 
that is emerging as an important 
feature of many sorts of tumour 
and tumour stem cell. 

The heterogeneity of human ESC 
cultures represents an additional 
hurdle in terms of producing safe 
stem-cell-based transplants. At the 
same time, it may offer a valuable 
bonus: the chance to purify variant 
human ESC sub-lines that are less 
tumorigenic. 


Discuss this paper at http://blogs. 
nature.com/nature/journalclub 
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NEWS 


Not so sunny after all 


Manufacturers in the solar-energy industry are downsizing and scaling 
back their once-ambitious plans. Katharine Sanderson reports. 


Lay-offs, stalled projects and grim 
financial forecasts are affecting the 
solar-energy industry, which until 
recently had been growing with little 


end in sight. 


In November 2008, BP Solar 
announced that it would be closing 
a photovoltaics factory in Sydney, 


RECESSION 
WATCH 


let go. 


funding rounds. “We suspended con- 
struction in November as it became 
obvious that access to capital was not 
available,” says spokesman Alan Bern- 
heimer. Nearly half the company’s 
workforce, or 300 employees, will be 


Some legislative incentives that 


Australia, so that it could focus its opera- 
tions in countries with lower manufacturing 
costs. Since then, GT Solar, of Merrimack, 
New Hampshire, let go 25 of its 300 workers; 
Day4Energy in Burnaby, Canada, slashed one- 
third of its staff, some 95 employees; and even 
Suntech, one of China’s largest solar compa- 
nies, in Wuxi, Jiangsu province, shed 800 jobs, 
or 10% of its work force. 

One of the most dramatic cutbacks has 
come at OptiSolar of Hayward, California. 
The company has been planning to open the 
largest photovoltaics power plant in the world, 
a 550-megawatt facility in San Luis Obispo 
county in California, by 2013. OptiSolar says 
it still intends to build the plant even though 
it has been unable to raise money in its latest 


could help the industry are on the horizon. 
Last October, the US Congress extended for 
another eight years the 30% tax 
credit for those who invest in 
renewable-energy projects. But 
many investment firms aren't 
making enough money to take 
advantage of the credit. “This 
is probably the biggest problem 
for US industry,’ says Jenny Chase, a London- 
based analyst for New Energy Finance. The 
Solar Energy Industries Association in Wash- 
ington DC has called for the incentives to be 
made refundable so that investors can claim 
them even if they are not making money. 

The economic stimulus package proposed 
by the US House of Representatives last week 


“This was always 
likely to be the 
year when supply 
overtook demand.” 


(see page 364) doesn't include such a plan, but 
it does contain a US$8-billion loan guarantee 
programme for renewable energy technologies. 
If passed, this proposal could help OptiSolar, 
which is applying now for loan guarantees 
from the government. “We seem to be just what 
they're talking about,” says Bernheimer. Still, 
even Monique Hanis, director of communica- 
tions for the industry organization, says that 
“these are just proposals. There's a lot of work 
to be done.” 

Current tax incentives also don’t cover solar- 
cell manufacturers, notes B. J. Stanbery, chief 
executive of HelioVolt, a company in Austin, 
Texas, that manufactures copper-indium- 
gallium-selenide solar cells. Helio Volt, which 
is just about to bring its first manufacturing 
plant online, has also made 
some employees redundant, 
but won't say how many. Stan- 
bery says that the redundancies 
were part of the process of shift- 
ing from research and develop- 
ment to manufacturing, rather 
than because of the economic crisis. “Frankly, 
it had more to do with adjusting the skills mix 
in the company,” he says. 

Stanbery predicts that his company will 
survive as it is young and small — it expects 
to generate 20 megawatts this year from its 
production facility in Austin. “I don’t think 
it’s going to be a really tough year for us,” he 


Translational research in Berlin hits a roadblock 


A historic biomedical research 
campus in former East Germany 
is retrenching after a prolonged 
political attack. Unproven 
accusations alleged that 

€15 million (US$20 million) of 
state money had flowed illegally 
into the private Helios hospital 
in the Berlin suburb of Buch. 
Clinical researchers there now 
find themselves struggling to 
rebuild collaborations with basic 
scientists. 

The dispute concerns a contract 
forged in 2001 between a national 
research centre, the Max Delbriick 
Center for Molecular Medicine, 
and two hospitals: Helios, which 
is private, and the Charité, which 
is Berlin's teaching hospital. The 
aim was to promote translational 
research. But the hospitals 
terminated the contract in 
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December 2008 “because we felt 
that the scandalizing would never 
stop unless we did", says Karl Max 
Einhaupl, head of Charité. Einhaupl 
says that dividing funds between 
research and patient care is always 
complicated, but that he doesn't 
think any state money was used 
inappropriately. 

Buch developed over the past 
century as a state-of-the-art 
biomedical and clinical campus. 
After reunification, the former West 
German research system absorbed 
the basic-research institute but 
not its clinical facilities, which were 
eventually privatized. 

Keen to improve the parlous 
state of translational clinical 
research in Germany — something 
that East Germany had done well 
in Buch — the German government 
helped to facilitate an agreement 


with the private sector. Clinicians 
from Charité’'s medical school were 
appointed to head clinics at Helios, 
and their salaries were shared 


between the hospitals. This gave 
clinicians with research interests 
convenient access to the laboratory 
facilities at Buch. 


The Buch biomedical campus in northeast Berlin is home to large lab facilities. 
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INTERVIEW: PAUL CHU 
Hong Kong inaugurates 
Institute for Advanced 
Study. 
www.nature.com/news 
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Solar panels are now being made faster than they can be sold. 


says. For others in the solar industry at large, 
he predicts, “it is going to be brutal”. 

Solar’s rapid growth in recent years has 
coloured expectations for the field, says Ken 
Zweibel, director of the Institute for the Anal- 
ysis of Solar Energy at George Washington 


ona 40-50% growth rate for ten years now, he 
says. “What we're losing is that growth rate.” 
This year was always going to be tough for 
solar, says Chase. In Europe in particular, four 
years of generous subsidies have encouraged 
new players to join the fray, leading to a surge of 


likely to be the year when supply overtook 
demand, she says. In Spain, the government's 
incentive scheme was so popular that roughly 
3 gigawatts of solar-power capacity were con- 
nected to the grid in 2008, and authorities are 
investigating reports of fraudulent hook-ups. 
The Spanish government has since capped this 
year’s production subsidies at 500 megawatts. 

In Bitterfeld- Wolfen, Germany, solar-cell 
manufacturer Q-Cells closed its facilities tem- 
porarily over the Christmas—New Year period 
to save money. Q-Cells has also put on hold 
plans to build a silicon wafer and ingot plant 
in Malaysia. “We will have to focus on our core 
business — increasing cell production,’ says 
says spokesman Stefan Dietrich. “There’s not 
much money around” 

In December, Q-Cells reduced its forecast 
net income for the 2008 financial year from 
€215 million (US$286 million) to €185 mil- 
lion. Similarly, the Chinese company LDK 
Solar, in Xinyu, Jiangxi province, cut its 2008 
forecast from a maximum in revenue of $565 
million to amaximum of $435 million. 

One thing is sure, says Chase: the prices of 
solar modules will plummet in 2009 because 
of the oversupply. But low prices could mean 
that the developing world gets better access to 
solar power, or attracts investors to companies 
that had been flagging. “I think the return on 
solar will go up,” she says. It’s just that some 


University in Washington DC. “Solar has been 


solar modules on the market. “This was always 


companies won't make it that long. | 


But Germany has always been 
uncomfortable with public-private 
partnerships, particularly within 
the cash-strapped health system. 
For more than a year, politicians 
had been piling pressure on 
the hospitals, suggesting that 
public money was seeping into 
patient care under the guise of 
research. Two investigations 
by the consultancy firm 
PricewaterhouseCoopers failed 
to disentangle the payments, but 
concluded that there was “no badly 
intentioned, deliberate cross- 
transfer of funds”. 

Now that the contract has been 
terminated, clinic heads at Helios 
have had to decide whether to stay 
with the hospital and lose their 
academic status, or move into the 
city at Charité and lose convenient 
access to the Buch laboratory 
facilities. 

In the future, the Max Delbriick 
Center will collaborate with Helios 
on a project-by-project basis with 


much paperwork to specify who 
pays for what. 

“The flow of finance will be 
clearer this way, but [the new 
bureaucracy] doesn't make our 
lives particularly pleasant,” says 
Frederick Luft, a nephrologist and 
hypertension expert who decided 
to stay with Helios so that he 
could stay in Buch. “It makes a big 
difference to be able to walk 10 
minutes between the hospital and 
the lab,” he says. 

Many of Luft's colleagues also 
believe that the distance between 
suburban Buch and the downtown 
Charité clinics, which can take up to 
an hour on public transport, makes 
collaborations less attractive. 

“Translational research works 
best when clinicians and basic 
scientists meet on a daily basis," 
says Detlev Ganten, founder of 
the Max Delbriick Center, former 
Charité chief and architect of 
the public-private collaboration 
contract in 2001. Still, he says, 


“Buch is not as disconnected as it 
was, and it will be possible to get 
things to work again.” 

Another part of the effort to 
promote translational research 
at Buch has also 
run into trouble. 


“Translational 


Schavan is scheduled to open 

an ambitious new €10-million 
magnetic resonance imaging 
facility at the Max Delbriick Center. 
It includes a 7-tesla imager for use 
in humans, anda 
9.4-tesla imager for 


A publicly funded research works best _ animals. 
Experimental and when clinicians and Aside from their 
Clinical Research é : more conventional 
Center had been basic scientists meet use to look inside 


planned to be built 
on Helios land in 
2001, but will now be built a few 
hundred metres away, next to the 
Max Delbriick Center, instead. 
The change will delay its planned 
2011 opening by at least a year. 
Collaborations between the Max 
Delbriick Center and the Charité, 
both of which are publicly funded, 
will now intensify and be organized 
through the Experimental and 
Clinical Research Center. 

This week, the Buch campus 
sees at least one piece of good 
news. Research minister Annette 
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ona daily basis." 


the brain, the 
machines will 

also be used to develop methods 
for imaging the beating heart, 

a challenge very few groups 
around the world are attempting. 
The facility is meant as another 
step in Germany's embrace of 
translational research in medicine. 
“Imaging is an important bridge 
between basic research and 
medicine," says pharmacologist 
Walter Rosenthal, scientific 
director of the Max Delbriick 
Center. | 
Alison Abbott 
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Cash boost for US science 


Researchers in line for $13-billion windfall. 


After getting their first glimpse of the massive 
financial stimulus bill last week, US researchers 
are scrambling to work out how to get some of 
the billions of dollars proposed for science and 
technology into their laboratories. 

On 15 January, the House of Representatives 
released details of its proposed US$825-billion 
economic stimulus bill. Along with other 
spending initiatives and a raft of tax cuts, the 
blueprint includes new, one-off funding for 
federal research and development that totals 
more than $13 billion. 

Among the big winners is the National 
Science Foundation (NSF), which would 
receive an additional $3 billion — half of its 
annual budget — of which $2 billion would go 
directly to research grants. The National Insti- 
tutes of Health (NIH) would receive $3.5 bil- 
lion, of which $1.5 billion would be for research 
at NIH centres over two years; $1.5 billion for 
building grants at university research facili- 
ties; and $500 million for construction on the 
NIH campus in Bethesda, Maryland. The 


Department of Energy's Office of Science would 
receive $2 billion, which includes $400 million 
to kick-start the Advanced Research Project 
Agency-Energy, which is meant to fund high- 
risk research in innovative energy ideas. 

As expected, other clean-energy initiatives 
also reaped billions of dollars. Some $4.5 bil- 
lion would go to efforts to develop a smart 
electricity grid; $8 billion would go to loan 
guarantees for renewable-energy technologies 
(see page 362); and $2.4 billion would go to 
projects in carbon capture and sequestration 
at fossil-fuel-burning plants. 

Many research advocates had pushed 
Congress and the incoming administration 
of Barack Obama for such investments, and 
predictably applauded the proposal (see 
Nature 457, 240-241; 2009). 

Robert Berdahl, president of the Association 
of American Universities in Washington DC, 
said the bill represents a solid endorsement 
of the scientific community's argument that 
investing in research and education provides 


House Appropriations 
chairman David Obey 
unveiled the winners in the 
economic stimulus bill. 


Ebola outbreak has experts rooting for answers 


When the Ebola Reston virus was 
discovered in pigs in the Philippines last 
year, it marked the virus’s first known foray 
outside primates, and raised fears ofa 
potential threat to human health. 

Last week, a joint mission of 22 
international health and veterinary experts 
returned from investigating the outbreak 
with more questions than answers about the 
virus’s pathology and epidemiology. 

The Ebola Reston virus was first 
discovered, in 1989, in crab-eating 
macaques imported to the United States 
from the Philippines. Since then, the virus 
has killed most infected monkeys, yet had 
no effect on the 25 people that it infected 
— unlike three of the four other strains of 
Ebola, which kill between 25% and 90% of 
the humans they infect. 

Because few people come into close 
contact with primates in the Philippines, the 
risk of catching Ebola Reston in this way is 
relatively low. By contrast, the appearance of 
the virus in an important livestock species 
was unexpected and worrying, says Pierre 
Rollin, an Ebola expert at the US Centers 
for Disease Control and Prevention (CDC) 
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in Atlanta, Georgia, who was part of the 
mission to the Philippines. “We never 
thought that pigs could be infected,” he says. 
Once inside the pig, it may be possible 
for the virus to mutate into a version that 
is deadly to humans, as the avian influenza 
virus is thought to have done. “And we still 
don’t know what it might do to someone 
who is immunocompromised by HIV or by 
drugs,” Rollin adds. 
But there seems to be little threat to 


human health from the current form of the 


Is an Ebola-virus subtype killing pigs? 
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virus. It is destroyed by cooking, and there 
is no evidence of symptoms in pig handlers, 
who will soon be tested to find out if they 
have developed antibodies to the virus. 

The investigation into the Ebola Reston 
infections began after farmers in the 
Philippines reported high mortality rates 
in their pigs in 2008. In September, samples 
from 28 dead pigs were sent to the Plum 
Island Animal Disease Center in New 
York, where researchers found evidence of 
the porcine reproductive and respiratory 
syndrome virus, also known as blue-ear pig 
disease, which has seen many outbreaks 
in Asia in recent years. But in six of the 
samples they also found Ebola Reston. 

This virulent, biosafety-level-4 pathogen 
requires special laboratory facilities, so the 
pig samples were rushed to the CDC labs in 
Atlanta for further analysis. 

Despite the presence of other diseases 
in the samples — including swine fever, 
and the porcine circovirus type II — Rollin 
thinks that Ebola Reston is to blame for the 
pigs deaths because histological samples 
showed that the virus had pervaded the 
spleen, similar to its mode of attack in 


HAVE YOUR SAY 
Comment on any of our 
News stories, online. 
www.nature.com/news 


jobs while laying the foundation for a cleaner, 
more competitive economy. The trick, he says, 
will be getting the bill through Congress and 
then sustaining funding into the future. 

“We hope that this gets built into the base 
and that it is essentially front-loading some 
of the increases that are planned for 2010 and 
beyond,” says Berdahl. Obama is scheduled 
to present his proposed budget for fiscal year 
2010 in early February. 

But some have questioned 
whether the one-time infusion 
of cash will matter much to 
agencies whose budgets have 
flatlined or been lower than 
expected in recent years. Elias 
Zerhouni, former director of the 
NIH, says the stimulus package does not focus 
enough on sustaining scientists, concentrating 
instead on the facilities that house them. The 
current proposal “is too timid and not strategic 
enough in addressing the long term’, he says. “It’s 
short-term wise but long-term ineffective.” 

Democratic congressional leaders, spear- 
headed by House Speaker Nancy Pelosi of 
California, developed the bill in consultation 
with Obama’ transition team. Although the 
legislation is sure to change as it moves through 


“Researchers out 
there could actually 
spend alot of money 
wisely and quickly if it 
were made available.” 


the House and the Senate, the fact that it has 
the tacit approval of both Pelosi and Obama 
means that it is likely to survive in some form. 
Republicans, however, were not consulted. 
Most of the stimulus spending would 
extend over two years, although money for 
peer-reviewed grants must be awarded within 
120 days to ensure it is spent quickly. That 
could mean that agencies use the money to 
fund peer-reviewed grants 
that previously scored highly 
but were not funded because 
of a lack of money at the time. 
At the NSE, individual direct- 
orates are likely to be allowed to 
determine how they will spend 
the $2-billion windfall. Other 
chunks have been designated for specific pro- 
grammes: $400 million, for instance, will go to 
the major research equipment and facilities pro- 
gramme, which includes large projects that must 
be approved by the National Science Board. 
Another $500 million will go to instrumenta- 
tion, including modernization and retrofitting. 
At NASA, the $400 million targeted for the 
science office includes $250 million to acceler- 
ate Earth-sciences projects recommended in 
a recent prioritization survey by the National 


Academies. Among other things, that would 
pay for a climate sensor measuring total solar 
irradiance to be put back on the next genera- 
tion of US weather satellites; it had previously 
been removed to save money. 

Bart Gordon (Democrat, Tennessee), chair- 
man of the House Science Committee, called 
the bill a long-delayed down payment on the 
American Competitiveness Initiative, which 
seeks to boost research and education in mathe- 
matics, engineering and the physical sciences. 

Neal Lane, a professor at Rice University in 
Houston, Texas, and a former science adviser to 
President Bill Clinton, says agencies such as the 
NSF and the Department of Energy will prob- 
ably be able to absorb and spend the additional 
funding quickly, in part because they have been 
planning for increased funding under the com- 
petitiveness initiative. The NSF in particular has 
been faced with a backlog of requests and peer- 
reviewed proposals for facilities and construc- 
tion of major research equipment. “Those seem 
like reasonable places to make a quick and early 
investment, he says. “The researchers out there 
could actually spend a lot of money wisely and 
quickly if it were made available tothem” 
Jeff Tollefson; additional reporting by 
Meredith Wadman and Rich Monastersky. 


monkeys. Further pathology tests are due 
to begin in spring at the Australian Animal 
Health Laboratory in Geelong, Victoria. 

The infected pigs came from several farms 
on the island of Luzon, and on 13 January, 
health officials collected blood and tissue 
samples from hundreds of apparently 
healthy pigs there. Although Rollin does 
not expect to find the virus itself in these 
samples, the pigs may carry antibodies that 
should indicate an approximate mortality 
rate associated with exposure. 

Rollin suspects that, as is the case with 
monkeys, the infections resulted from 
contact with a reservoir of the virus, rather 
than spreading from animal to animal. In 
2005, outbreaks of human Ebola in Gabon 
and the Republic of the Congo were traced 
back to colonies of bats (E. M. Leroy et al. 
Nature 438, 575-576; 2005). “It’s almost 
certainly the case [in the Philippines],” 
says Rollin. 

The virus is likely to be spread by bat 
droppings falling into the pigs’ feed, and 
the threat of infection could be reduced by 
moving fruit trees, where the bats roost, away 
from pig farms, or by putting roofs on pig 
enclosures. “We cart exterminate it, we just 
have to learn how to avoid it; says Rollin. ™ 
David Cyranoski 


GRAPHIC DETAIL 


Prices plummet on carbon market 


The price of European Union 
(EU) allowances for carbon AG 150 
dioxide emissions has = 
reached an all-time low, hit by SO S 
falling oil and gas prices, and ee s 
expectations that economic s 2 ye o 
recession will lead to reduced 55 8 
energy demand. Ft of a 
Under the EU's mandatory ES mM § 
emission trading system — set a 
up in 2005 and still by far the I 
largest such scheme in the 


world — power plants and 
other CO,-intensive industries 
can buy emission allowances 


that allow them to exceed their 
government-allocated CO, 
caps. In 2008, the equivalent 
of almost 5 billion tonnes of 
CO, was traded on the global 
market, an 83% rise on 2007. 
At close of trading on 
19 January, allowances to 
emit one extra tonne of 
CO, in 2009 were selling at 
just €11.65 (US$15.32) on 
European exchanges that 


trade carbon, such as the 
European Energy Exchange 
(EEX) in Leipzig, Germany 
(see graph). The price of 
allowances saw a recent peak 
of above €30 in July 2008, 
but falling oil and gas prices 
have encouraged electricity 
generators to switch from 
burning coal to cleaner 
natural gas, reducing demand 
for emission allowances. 


Given the dire economic 
outlook, analysts believe that 
it will be difficult to stop the 
market's negative trend in the 
next few months. 

The EU's emission trading 
system is due to be reformed 
in 2013 to reduce the amount 
of free allowances handed 
out to companies (see Nature 
456, 847; 2008). 

Quirin Schiermeier 
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Which moon to shoot for? 


Planetary scientists have a rare chance to pick the destination for their next big mission. 
But will it be Titan or Europa? Eric Hand reports. 


n 2005, Jonathan Lunine, a planetary 
scientist at the University of Arizona in 
Tucson, was one of the first dozen people 
to glimpse the surface of Saturn’s moon Titan. 
As the Huygens probe fell, a bizarre landscape 
emerged: an icy world of hills and channels 
carved by liquid methane. “And I had two 
emotions, one right after the other,” he says. 
“The first was elation, astonished elation! And 
the second one, almost immediately, was sad- 
ness. Because I knew that I wouldnt get to see 
this again in my scientific lifetime.” 

But Lunine might just be that lucky — if 
NASA and the European Space Agency (ESA) 
choose to return to Titan two decades hence. By 
the end of this month, agency 


have been the brightly coloured horse. In 
1995, the Galileo probe began an 8-year tour 
of Jupiter’s system, during which it snapped 
the first close-ups of Europa’s scarred surface. 
Analysis of a magnetic anomaly soon revealed 
the moon’s most astonishing feature: that egg- 
shell of ice is thought to enclose a warm, salty 
ocean. Scientists immediately clamoured to 
return. The JPL began a Europa Orbiter mis- 
sion study in 1999; Europa was ranked a top 
priority in an important 2002 community list; 
even Congress told NASA, in 2005, to begin a 

Europa mission. 
The need to go back is still great, says 
Pappalardo. Galileo made only 11 passes of 
Europa and was hampered 


officials plan to pick a desti- “Are you going to pick by an antenna that failed to 
nation for a massive mission, the brightly coloured fully deploy, limiting its data 
costing nearly US$4 billion, to h : transmissions. An orbiter 
be launched around 2020 for MOFS€, OF are you going equipped with new instru- 
the distant reaches ofthe Solar to ask, ‘Is that horse ments — in particular, an 


System. The battle pits Titan, 
which recent discoveries have 
made the cool new kid on the block, against 
Jupiter’s moon Europa, which has long sat atop 
community wish lists. 

The impending decision — a big deal for 
a science community that typically sees its 
projects come to fruition just once a genera- 
tion — has planetary researchers taking sides. 
“T think it’s going to come down to a matter 
of the science,” says Lunine, co-chair of a 
study pushing for a visit to Titan. His mission, 
described in glossy brochures and snazzy com- 
puter simulations, includes a hot-air balloon 
straight out ofa Jules Verne novel, which would 
drift in Titan’s cold breeze. 

On the other side are the Europa support- 
ers, who argue that a decade of mission-design 
work should carry the day. They envisage a 
NASA mission to Europa combined with an 
ESA mission to Ganymede, another Jovian 
moon with an intriguing magnetic field. Bob 
Pappalardo, a planetary scientist at the Jet 
Propulsion Laboratory (JPL) in Pasadena, 
California, has worked on five Europa mis- 
sion studies over a decade; the Titan mission 
concepts, he says, just recently got going. “Are 
you going to pick the cool, brightly coloured 
horse, or are you going to ask, ‘Is that horse 
ready to run?” he says. “These reviews should 
be looking at the teeth and gums and seeing 
which one is ready to go.” 

Ten years ago, a mission to Europa would 
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ready to run?’” 


ice-penetrating radar — 
would end debate over the 
thickness of the ice shell, and reveal the depth 
of its fissures and cracks. Astrobiologists want 
to know how isolated any ocean might be from 
the surface, where withering radiation would 
make life tough but could also lead to chemical 
exchanges that nourish life. 


Safe from harm 

Radiation would damage not only life but 
spacecraft electronics and, over the past 
decade, engineers have developed technol- 
ogy to combat it. The earlier Europa studies 
funded research in radiation-hardened, or 
rad-hard, electronics that are now used on 
military satellites and NASA spacecraft, mak- 
ing the components cheaper. Where rad-hard 


A trip to Europa would mean building an orbiter... 
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components still don't exist or are too costly, 
engineers will nestle the orbiter’s 11 proposed 
instruments behind metallic shields of tanta- 
lum and tungsten. 

Still, 20% to 30% of the mission’s cost is for 
radiation protection, according to Europa 
study leader Karla Clark of the JPL. The 
Europa orbiter must fit within a NASA cost 
envelope of $2.9 billion, while the ESA con- 
tribution, the Ganymede orbiter, must cost 
less than €650 million ($860 million). At 
those prices, a landing element, or even an 
ice-drilling cryobot, is impossible. But Pap- 
palardo argues that sending a Europa orbiter 
now could pave the way for a future lander — 
by scouting for the smooth pavement. “We're 
ready to go to Europa now, he says, “and we'll 
be ready to do Titan next” 

Jupiter, then Saturn, then Europa, then Titan 
— this notion of ordered ‘turns’ doesn't fly for 
Ralph Lorenz, a planetary scientist at the Johns 
Hopkins University Applied Physics Laboratory 
in Laurel, Maryland. By that logic, he says, it’s 
Neptune’ turn; it’s Venus’s turn. “You've got to 
choose the mission that's giving you the best sci- 
ence value for the money,’ he says. “And Titan 
is the low-hanging fruit.” With its atmosphere, 
its hydrological cycle and even its potential for 
cryovolcanism, Titan would mobilize a much 
wider scientific community, he says. 

NASA would send a spacecraft into polar 
orbit for four years, where it would spend more 
time at Titan in its first three days than the Cas- 
sini mission to Saturn will in its entire mission. 
To keep costs down, the orbiter would carry 
six instruments — about half as many as Cas- 
sini. But they would be fine-tuned for Titan, 
says Lunine. A mass spectrometer, for instance, 
would be sensitive to molecular chains with 
hundreds of carbons — orders of magnitude 
more sensitive than Cassini’s spectrometer, 
which is limited to molecules with seven or 
eight carbons. 

After arrival, ESA would showcase its 
contributions. The first — a dropped lander 
akin to Cassini’s Huygens probe — would 
splash down into one of two giant methane- 
ethane lakes near Titan’s north pole. It would 
float for a few hours in the complete darkness 
of winter and suck up samples through straws, 
sifting for the molecular evidence of organized 
organic chemistry that, even if pre-biotic, so 
excites astrobiologists. “You go to the lake,” 
says Lunine, “because the lake is nature's great 
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Tiny springboards detect 
viruses in fluids. 


..but a mission to Titan could feature a hot-air balloon. 


organic solvent medium for delivering these 
components to you.” 

But the piéce de résistance is the 11-metre- 
wide hot-air balloon, which, drifting along 
10 kilometres high in gentle winds near the 
equator, would image the surface through the 
methane mist. Heated by lumps of radioactive 
elements, the balloon would circle the moon 
at least once in a lifetime of a minimum of six 


months, says Athena Coustenis of the Paris 
Observatory, the lead European study scien- 
tist for the Titan mission. The heat shield used 
during the balloon’s initial descent would not 
be wasted: there is a proposal to have it stick 
gently in the soil as a ‘geosaucer, with a seis- 
mometer and magnetometer to measure tidal 
flexure and cryovolcanic rumblings. The 
balloon is likely to be built in France where, 
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says Coustenis, “balloons are a big deal”. 

In the end, the decision between Titan and 
Europa could come down not just to scientific 
promise versus technological readiness, but 
also to agency politics. The decision-makers, 
Edward Weiler and David Southwood, science 
chiefs for NASA and ESA respectively, haven't 
been shy in expressing personal opinions. 
“Tve seen the data coming out of Cassini and 
it knocks your socks off?’ says Weiler. “I just 
have a preference for Titan.” And Southwood 
says, “I have an emotional preference for Sat- 
urn and Titan. But I probably have a scientific 
longing for Jupiter” 


Early decision 
After reviewing the mission studies and reports 
from the competing teams, Weiler and South- 
wood will hold a teleconference by the end of 
the month in the hope of making a decision 
before an ESA advisory committee meeting on 
3 February. Weiler says 
technological readi- 
ness will probably be a 
deciding factor. If both 
missions are deemed 
technologically sound, 
Weiler says he may ask a panel convened by 
the US National Academies for advice on the 
most scientifically worthy moon. 

At ESA, the chosen mission will still have to 
compete against two other missions vying for 
one €650-million slot within the agency's Cos- 
mic Vision programme, a competition not due 
for final selection until 2011. But Southwood 
says that ESA needs to decide now whether it 
wants to pursue Titan, or Ganymede during 
a NASA mission to Europa. That’s because 
the technologies and approaches needed for 
either moon are so disparate. Titan would 
mean developing a lander and balloon linked 
tightly to the US orbiter. Ganymede would 
mean developing an orbiter to be launched 
separately from the US Europa probe. And 
should Europe decide, come 2011, that some 
entirely different mission should get the Cos- 
mic Vision slot, then NASA would be left to 
either pick up some of the European ideas for 
its own Titan or Europa mission, or to scale 
back its own ambitions. 

Although proposal scientists are nervously 
anticipating the decision, they will have a lot 
longer to wait for the mission itself. A probe 
launched for Jupiter in 2020 wouldnt arrive in 
Europa orbit until 2028, and a balloon bound 
for Titan wouldn't deploy until 2030. Lunine 
would be 71 years old — not necessarily beyond 
the workplace and, he hopes, spry enough for a 
little ballooning. 


See Editorial, page 358. 
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A fly by any other name 


Drosophila experts argue over reclassification proposal. 


The Byzantine world of species taxonomy is 
facing a new test: a proposal that would involve 
renaming Drosophila flies, arguably the lead- 
ing model genetic organism. 

The idea is to bring a new taxonomic order 
to the more than 2,000 species now grouped 
in the genus Drosophila. An application pend- 
ing before the International Commission on 
Zoological Nomenclature (ICZN) would des- 
ignate Drosophila melanogaster — the primary 
species used for genetics studies — as the type 
species for the genus. Currently, the type spe- 
cies is D. funebris, which was described in 1787 
by Johann Fabricius. 

Kim van der Linde, a postdoctoral researcher 
at Florida State University in Tallahassee, and 
colleagues filed the application in December 
2007. Recent genetic work has shown that 
D. melanogaster could be classified as Sopho- 
phora and could therefore be renamed Sopho- 
phora melanogaster. Van der Linde says her 
proposal is designed to preserve the name 
D. melanogaster. 

Launching the idea to update the nomencla- 
ture has let the fly-naming conundrum out of 
the bottle, touching off what ICZN staff call an 
unprecedented debate. Some worry that teach- 
ing, citation searches, publications and databases 
such as FlyBase would face a logistical nightmare 
if D. melanogaster changed its name. 

“If this were some obscure beetle, you could 
rename it Godzilla and it wouldn't make much 
difference,’ says Therese Markow, a geneticist 
at the University of California at San Diego who 
directs the Drosophila Species Stock Center. 
“But this is the premier genetic 
model system.” In November 
2008, Markow convened a 
workshop in San Diego where 
many of the world’s leading 
Drosophila authorities wrestled 
with van der Linde’s proposal, 
without resolve. 

Markow supports the status quo. So does 
Thomas Kaufman, a Drosophila geneticist at 
Indiana University in Bloomington and co- 
leader of FlyBase. “We should leave the naming 
alone — everything is working fine,” he says. 
“We can't even get agreement on names for the 
mutants, let alone the whole genus.” A number 
of formal comments on the ICZN petition were 
strongly opposed to it. 

Still, some taxonomists say it’s time for reor- 
dering, especially given the recent sequencing of 
many Drosophila species, such as the 2007 publi- 
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“If this were some 
obscure beetle, you 
could rename it 
Godzilla and it wouldn't 
make much difference.” 


Name game: Should the Drosophila melanogaster fly be the type species for the genus? 


cation of a dozen genomes that show how closely 
related species are to each other on a genetic 
level”. Van der Linde, a community ecologist 
trained in the Netherlands, got started on her 
proposal after doing some fieldwork in the Phil- 
ippines. She joined up with several colleagues, 
including the Drosophila taxonomist Masanori 
Toda of Hokkaido University in Japan, and nam- 
ing authority Gerhard Bachli of the Zoological 
Museum in Zurich, Switzerland. 

The group developed a 
petition (see http://tinyurl. 
com/999mep) for the ICZN, 
and wrote a manuscript pre- 
scribing their interpretation. In 
renaming, a scientist can seek 
the approval of the commis- 
sion, which then publishes the 
decision of its panel of about 25 authorities. Or 
a scientist can publish an article proposing the 
new order, leaving it to scientific communities 
to adjust names. “People can follow or reject us,” 
says van der Linde. 

A decision from the commission could take 
another year or more. Ifit rejects the proposal, 
taxonomists could take it on themselves to 
split the genera in descriptions in future pub- 
lications. Under this scenario, D. melanogaster 
could become S. melanogaster, and 1,000 other 
species could be renamed too. 
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Earlier this month, van der Linde said she 
received commission feedback suggesting the 
petition may be set aside. That would leave 
the community to split the genus at will. This 
approach is supported by the international edi- 
torial team of the BioSystematic Database of 
World Diptera, a US-based fly name resource. 
In acomment on van der Linde’s petition, they 
note that today’s computer search engines “will 
have no trouble finding information” if D. mel- 
anogaster were to become S. melanogaster. 

Patrick O'Grady, a geneticist at the Univer- 
sity of California at Berkeley who does not 
support the van der Linde proposal, disagrees. 
“There would be chaos in the literature if 
D. melanogaster changes to S. melanogaster,’ he 
says. “Genetic work could be lost. It would be 
hard to find things.” As an example, he points 
to the 2004 renaming, through publication’, 
of the mosquito that spreads yellow fever and 
dengue: Aedes aegypti to Stegomyia aegypti. 
Some researchers cite the new name; others 
ignore it and use the old name’. a 
Rex Dalton 
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No bull: genes for better milk 


On 13 January, the US Department 
of Agriculture (USDA) launched a 
service that allows dairy-cattle breeders 
to double their chances of selecting the 
best bulls to sire milk-producing cows. 
“This is the future of animal breeding,” 
says Juergen Richt, a veterinary surgeon 
at Kansas State University in Manhattan. 
For a decade, breeders who want to 
locate the best bull have the animals’ 
semen tested for its DNA, looking for 
traits linked to milk quality and pro- 
duction. About a year ago, the leading 


DNA testing can predict the best milk producers. 


about $225, and can be done when a bull 
is born, thus avoiding the $25,000-50,000 
cost of raising a bull for five years to see if 
it sires good milk-producing offspring. 
“The best bulls become elite breeders,” 
says Van Tassell, “The others become 
hamburger” 

Previously, DNA tests allowed a typi- 
cal breeder to select the best bull some 
35% of the time, says geneticist Ole Meland, 
vice-president of Accelerated Genetics 
in Baraboo, Wisconsin. The new 
technique identifies the best bull 70% 


artificial-insemination organizations 

in the United States and Canada funded a 
US$1-million research project directed by 
Curtis Van Tassell, a geneticist at the USDA's 
Bovine Functional Genomics Laboratory in 
Beltsville, Maryland. Working with Illumina 
Inc. of San Diego, California, Van Tassell’s 
team created a microarray chip containing 
54,000 genetic markers called single nucleotide 


polymorphisms, or SNPs, that involve at least 
a dozen traits, including those known to affect 
milk quality and production. 

Using high-throughput analysis, the 
researchers could then compare the DNA from 
ayoung dairy bull against the chip SNPs, telling 
breeders which bull would be likely to sire calves 
that were good milk producers. The test costs 


of the time. 

The US initiative is the first such nationwide 
programme. Companies in New Zealand and 
the Netherlands have set up private services for 
cattle breeders; and, following the USDAs lead, 
similar systems are being built by researchers at 
Aarhus University in Denmark, and in France 
and Australia. a 
Rex Dalton 
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Plan to grow more cannabis 
for research turned down 


US researchers do not need a second 
federally approved facility for providing 
research-grade cannabis, the Drug 
Enforcement Administration (DEA) has 
ruled. The decision ends an eight-year 
bid by Lyle Craker at the University of 
Massachusetts in Amherst, to grow cannabis 
for medical research, which was supported 
by the Multidisciplinary Association for 
Psychedelic Studies (MAPS). A single 
laboratory at the University of Mississippi 
supplies cannabis under a contract with the 
National Institute of Drug Addiction. 
Craker, MAPS and others have argued 
that the current supply of cannabis is of 
inconsistent quality, difficult to obtain, and 
that efforts to widen that supply are being 
stalled for political reasons (see Nature 
430, 492; 2004). In 2007, a DEA judge 
recommended that Craker’s application to 
set up a second facility should be granted; but 
the DEA’ final ruling, made on 7 January, 
said the current supply was adequate. 


FDA to regulate the use of 
transgenic animals 


The US Food and Drug Administration 
(FDA) has adopted guidelines for regulating 
genetically engineered animals. 

Released on 15 January, the rules have 
been more than a decade in the making. 
They follow a set of draft guidelines, released 
in September 2008, which drew fire for 
effectively treating genetically engineered 
animals as drugs (see Nature 456, 2; 2008). 

In response to other concerns about 
the transparency of the approval process, 
the new guidelines note that the agency 
intends to hold public advisory committee 
meetings before approving any genetically 
engineered animals. 

Earlier this month, an FDA advisory 
committee deemed an anti-clotting drug 
called ATryn, produced in the milk of 
genetically engineered goats, to be safe — 
an important step towards the eventual 


Goats: drug factories of the future? 


Europe set to clamp down on pesticide use 


After three years of wrangling, 
the European Union (EU) 

is expected next month to 
finally approve controversial 
legislation to regulate 
pesticide use. Early drafts of 
the law drew protest from 
manufacturers, farmers and 
scientists, who claimed that 

a drastic reduction in the 
number of available pesticides 
would lower crop yields and 
raise food prices. 
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The compromise ruling backed last week by the European Parliament will result in just 23 of the 
roughly 500 marketed pesticides being banned, according to the Swedish Chemicals Agency. 

The directive also promotes the use of non-chemical pest-control methods, bans aerial crop 
spraying without authorization and curbs pesticide use in areas such as parks and playgrounds. 
Once approved, the ruling must be incorporated into the national laws of all member states by 2011. 
For a longer version of this story, see: http://tinyurl.com/7d6zvt. 


approval of the drug for sale in US markets. 
For a longer version of this story, see http://tinyurl. 
com/8kv5dr. 


Polio eradication battle is 
bolstered by $630 million 


A final push to overcome the few remaining 
pockets of polio infection in the world 

is being supported by grants totalling 
US$630 million from the Bill & Melinda 
Gates Foundation, the humanitarian group 
Rotary International, and the UK and 
German governments. 

The funding, announced on 21 January, 
will support the World Health Organization’s 
Global Polio Eradication Initiative, which 
involves vaccinating billions of children. 
Since its launch in 1988, the effort has cut 
the annual polio toll from 350,000 cases in 
125 countries to just 1,600 cases last year. If 
the crippling disease could be eradicated, 
it would become the second disease, after 
smallpox, to be officially wiped out. 

Reservoirs of polio now remain in four 
countries where the disease is endemic: 
Nigeria, where polio vaccines were opposed 
following false rumours that they carried 
HIV and caused female infertility; India, 
where vaccine effectiveness has been 
hampered by poor sanitation and high 
population density; and conflict zones in 
Afghanistan and Pakistan. 

For a longer version of this story, see http://tinyurl. 
com/7e5s4u. 


Democrats hasten action 
on climate legislation 


Democrats in the US House of 
Representatives have announced plans 

to draw up climate legislation this spring, 
arguing that business leaders need regulatory 
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certainty to drive a green economic recovery. 
Incoming Energy and Commerce 
Chairman Henry Waxman, who held his 
first climate hearing on 15 January, said that 
the committee would deliver the legislation 
by the end of May, a decision backed by 
House Speaker Nancy Pelosi. Although 
President Obama favours a domestic cap- 
and-trade programme, many energy and 
climate analysts expected that the legislation 
would be delayed as lawmakers focus on 
the economy and a less controversial bill to 
advance clean energy. 
The US Climate Action Partnership, 
a coalition of major industrial firms and 
environmental organizations, bolstered 
Waxmans case by calling for legislation 
that would curb carbon dioxide emissions 
by 42% by 2030 and 80% by mid-century 
compared with 2005 levels. 


Billion-dollar neutron 
facility gets thumbs up 


The US Department of Energy has approved 
a US$1-billion upgrade to the Spallation 
Neutron Source (SNS) at Oak Ridge 
National Laboratory in Tennessee. 

The SNS generates neutrons by firing a 
high-energy beam of protons at a mercury 
target. The neutrons are used to probe 
the structures of materials ranging from 
proteins to superconducting ceramics. The 
existing SNS facility, completed in 2006, 
currently feeds neutrons to 10 instruments 
used by about 1,000 scientists and engineers 
last year. The addition ofa second target will 
allow for up to 24 more instruments, fed by 
long pulses of slower-moving cold neutrons, 
which can be used to study larger molecules 
such as polymers. 

Conceptual design work on the upgrade 
can now begin, with construction expected 
to finish no earlier than 2020. 
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Beijing's windy bet 


After spurning wind power, China has swung around and 
embraced this clean energy. But the nation’s love affair with wind 
may be spinning out of control, finds David Cyranoski. 


s he drives along a two-lane highway 

skirting the Gobi Desert, Hubert 

Beaumont sees nothing but wind. 

The road in this part of China's Gansu 
province runs completely flat — a monotonous 
route past grey rocks, a few shrubs and some 
100 sleek turbines towering over the empty ter- 
rain. “I can't think of anything else youd want to 
do here except wind energy,’ says Beaumont, a 
Beijing-based wind specialist at Ecofys, a green- 
energy consulting firm. 

Indeed, the region is pegged to become the 
core of a ‘mega wind-power base; a massive 
collection of wind farms with 5 gigawatts 
of capacity by 2010. Even considering that 
wind turbines generally produce 
only about a quarter of their 
advertised capacity, these 
would still generate enough 
juice to satisfy about a million 
energy-guzzling US homes. It 
will be the biggest wind-power 
development in the world. 

For a country with such a bad 
environmental reputation, China 
is fast amassing green credentials 
in wind. Wind-energy generation 
capacity has nearly doubled in each 
of the past three years, and in 2007 the 
country surpassed its goal to achieve 
5 gigawatts by 2010 (see graph). With 
plans to build four more mega-bases like 
the Gansu project, China is poised within the 
next decade to blow past four other nations, 
including Germany and the United States, the 
current number one and two in wind-energy 
capacity. 

China is also manufacturing and, increas- 
ingly, designing turbines. Domestic turbine 
manufacturers numbered just four in 2004. 
Now there are some 70. Within three years, 
says Haiyan Qin, secretary-general of the 
Chinese Wind Energy Association in Beijing, 
China will manufacture more turbines than 
other country. 

But experts wonder whether the superlatives 
applied to Chinese wind farms will also include 
‘least reliable’ and ‘most inefficient. 

Wind-turbine manufacturers and wind- 
farm developers everywhere have faced teeth- 
ing problems, but China has perhaps faced 
more difficulties than most. Its wind farms are 
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much less efficient than those in other leading 
countries, manufacturing defects have plagued 
Chinese equipment and the nation’s electri- 
cal grid cannot carry all the wind power the 
country is generating today, let alone the huge 
amounts planned for the next few years. Some 
critics, including several from international 
turbine companies, blame a lack of planning 
and poor Chinese manufacturing. 

Chinese engineers, entrepreneurs and 
government officials are working to improve 
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the situation but they have along way to go. 
As Beaumont passes the Gansu farm, he sees 
both the promise and the perils of China’s surge 


in wind power. A strong breeze blows across the 
desert but half of the turbines are standing still. 


Storm chasers 
Although China is currently chasing wind 
power, its leaders had little affection for it just 
a few years ago. Over the 1980s and 1990s, 
efforts by the World Bank, the Asian Develop- 
ment Bank and others to get energy from wind 
in China fizzled out. A goal to have 1 gigawatt 
of capacity by 2000 passed without notice, or 
achievement. 

“Before 2004, leaders thought wind was too 
small. They didn’t think it was real energy,’ says 
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Qin. But the sizzling 
economy boosted 
demand, energy prices 

soared and, in 2004, 24 
of China's 27 provinces 
were hit by blackouts. 

At around the same 

time, environmental 

pressures set in ahead 
of the 2008 Olym- 
pics. The government 
responded in Febru- 
ary 2005 with a Renew- 
able Energy Law and 
subsequent guidelines 
that called for all major 
power companies to create 
a percentage of their energy from renew- 
able sources other than hydropower: 3% by 
2010 and 8% by 2020. With biomass resources 
too sparse and photovoltaics too expensive 
(although China is the biggest producer in the 
world), most of that renewable energy will be 
achieved through wind. 

Beginning in 2004, farms started to open up 
in the windy northern and eastern perimeter of 
the country (see map). Total installed capacity 
climbed steeply: 1.3 gigawatts in 2005, 2.6 giga- 
watts in 2006, 5.9 gigawatts in 2007 and, accord- 
ing to early estimates, 10.6 gigawatts in 2008. 

But there is a hitch — China’s wind farms 
are underperforming. All power installations 
have a ‘capacity factor, which is calculated by 
dividing the energy actually produced by what 
the installations could maximally generate. 
According to data from the Beijing branch of 
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To meet its renewable-energy targets, China will need to get 5% of its power from wind farms by 2020. 


London's New Energy Finance, a consultancy 
firm that advises investors on developments in 
renewable energy, on-shore turbines in other 
leading wind power countries have capacity 
factors of around 30%. China's is just 23%. 

“China’s numbers are not good,’ says the 
firm's Justin Wu. “It might not seem significant, 
but a few percentage points could make the dif- 
ference between a farm that is economically 
viable and one that is not.” 

Wind experts blame several factors, starting 
with the turbines themselves. When wind-farm 
developers began gearing up around 2004, they 
imported turbines from established overseas 
manufacturers. Recently they have relied more 
on domestic makers. In 2005, Beijing added a 
requirement that 70% of turbine parts be made 
by domestic manufacturers. Major interna- 
tional turbine makers have established manu- 
facturing plants in China, but they are losing 
out to local firms. According to data collected 
by the China Wind Energy Association, 2008 
will be the first time that the installed capac- 
ity of Chinese-made turbines will exceed that 
of foreign ones. The 3.8 gigawatts already 
assigned to developers for the Gansu project, 
for example, does not include a single turbine 
from a foreign maker. 

That’s great for China's wind-turbine late- 
comers, but is it good for the wind farms? 
According to Wu, Chinese wind farms using 
foreign models have a 5% higher overall capac- 
ity factor than those using domestic turbines. 
Domestic turbines are especially unproductive 
when first set up, says Wu. 


Because the technology is newer and less 
tested, Chinese turbines are also more likely 
to be shut down for maintenance, according 
to anecdotal evidence. A turbine’s average 
down time is a closely guarded trade secret. 
But Xiliang Zhang, director of Tsinghua Uni- 
versity’s Institute of Energy, Environment and 
Economy in Beijing, says he hears reports that 
domestic turbines are standing still while for- 
eign models nearby are 
humming away. 

Domestic models are 
15-20% cheaper, and the 
quality has been rising 
quickly as they have either 
used proven foreign tech- 
nology or have hired for- 
eign engineers to help with designs. But as the 
lifetime of a turbine averages 20 years, the more- 
productive foreign models would be better buys 
overall, says Wu. (Representatives of two major 
Chinese turbine manufacturers did not return 
phone calls or e-mails requesting comments.) 

So why do wind-farm developers in China 
mostly choose domestic turbines? It is a con- 
troversial subject. Despite the mandate to 
source 70% of parts from domestic manufac- 
turers, turbines used in major national projects, 
including the recently started wind megabases, 
are supposed to be purchased through open 
bidding. But the bidding process is a tricky 
business and many in the industry assume an 
opaque policy of favouritism, especially con- 
sidering that most of the wind-farm developers 
and domestic turbine-makers are state-owned 
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“The Chinese government 
cannot be taken as a model 


of transparency.” 
— Paulo Fernando Soares 
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enterprises. “The Chinese government cannot 
be taken as a model of transparency,’ says Paulo 
Fernando Soares, chief executive of Suzlon 
Energy in Beijing, a subsidiary of a major tur- 
bine manufacturer in India. 


Spin tactics 

Turbine quality is one problem; finding the best 
turbine for the available wind is another. Every 
turbine has a ‘load envelope that defines roughly 
what wind speed, turbulence, wind shear and 
other conditions it will function best in. Some 
turbines work better for wind that comes in 
short bursts whereas others work better with 
long consistent spells of low wind. Poorly cho- 
sen turbines will be more likely to break down. 
A turbine with a wide blade can catch low winds 
but could be destroyed by typhoons. If turbines 
are poorly chosen for a site, their efficiency will 
plummet. “Decisions made in the first year 
will affect the 20-year life of the turbine,” says 
Sebastian Meyer, who researches wind-energy 
resources in China for Ecofys. “Serious compa- 
nies won't put turbines in sites for which they 
are not suitable,” says Soares. But until now, he 
says, Chinese developers have too often relied 
on inadequate wind data. 

China is now trying to address this problem. 
By June, it will finish installing 400 masts to 
measure wind-energy resources in various 
regions throughout China. The masts will 
stand 70 metres, 100 metres or 120 metres 
tall, to match the height of the turbines. China 
plans to spend more than 200 million renminbi 
(US$30 million) on the four-year project, 
which started in 2007. 
“This project is the key to 
make the national plan of 
wind-power development 
work,’ says Zhenbin Yang, 
deputy director of the 
wind-resource laboratory 
at the Chinese Academy of 
Meteorological Sciences in Beijing. Yang will 
analyse the data provided by the masts. 

The plan has its sceptics. Because wind can 
vary greatly over short distances, even this 
survey of broad areas of wind flow will be 
inadequate and ‘micrositing’ data will still be 
necessary from the specific regions where the 
turbines are planned, says Soares. 

More scepticism is aimed at a proposed 
secondary use of the data — to create new 
standards for turbine design specifically based 
on Chinese weather. “Wind turbines are not 
like refrigerators sitting in a house,’ says Zhang, 
“China is very different from the United States 
and Europe.” But although it is still not clear 
what form the Chinese standards might take, 
some question whether China really needs its 
own standards. “It’s no colder in Mongolia than 
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it is in Minnesota,” says Soares. 

Given China’ plans to make five huge mega- 
bases, the number of turbines in its future is 
staggering. By 2015, the Gansu project alone 
will boast 10,000 turbines with a combined 
capacity of 12 gigawatts. China's four other 
megabases — in Xinjiang, Inner Mongolia, 
Hebei and the Shanghai-Jiangsu region — will 
total 60 gigawatts. 

But to make use of all that energy, China’s 
wind hopefuls must tackle an even more 
intractable problem — the electricity grid. 
China's energy supply has been hamstrung by 
a fragmented and underdeveloped grid system 
that makes it difficult to get energy from coal- 
rich rural areas to the cities on the east coast. 
Wind has the same problem: it is produced 
mostly in sparsely populated regions that can- 
not use all that energy. The problem is com- 
pounded by the unreliability of a power source 
that is based on the weather. “Grid companies 
don't like wind. It changes too fast,” says Qin. 

Steve Sawyer, secretary-general of the Brus- 
sels-based Global Wind Energy 
Council, says that other coun- 
tries, particularly the United 
States, have struggled with 
their grids. This month, in fact, 
Democrats introduced a bill in 
the House of Representatives 
that allocates US $11 billion to 
modernize the grid. 


Hurry up and wait 

China’s rapid expansion has caused delays down 
the line. Turbines often have to sit idle — on 
average for four months — before they get 
hooked up to the grid. The backlog is huge. Of 
the 5.7 gigawatts of turbine capacity installed by 
the end of 2007, only 4 gigawatts was plugged 
into the grid. 

Getting connected is just the first hurdle. 
Many grids are simply too puny to carry all 
the electricity being made. At peak production 
times, turbines often have to shut down so 
as not to overload the electrical networks. 
Newer turbines can alter the angle of their 
blades to miss the wind, and will slow to a 
halt. The only loss is the energy. For older 
turbines, the operator has to slam on the 
brakes. Turbines in China wear through their 
brakes at remarkable speeds, says Meyer. 
Elsewhere, the brakes are usually used “once 
a month or less, but in China they’re using it 
every four days’, he says. 

The delays and losses could drive people 
away from wind power, says Qin. “If there is 
not a serious plan for the grid soon, we will 
not be able to develop wind energy beyond 
the next few years,’ he warns. 
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"Grid companies 
don't like wind. It 


changes too fast.” 
— Haiyan Qin 


Gridlocked: Problems with the 
electricity grid mean that wind 
turbines are often at a standstill. 


So why hasn’t the Chinese government's 
legendary ability to get things done kicked in. 
According to a December 2008 report by New 
Energy Finance, China’s National Develop- 
ment and Reform Commission — the body that 
oversees national economic 
and social development — is 
concerned about the “lack of 
supervision in China’s rapid 
wind-power growth’, especially 
when it comes to the grid. 

But the government is afraid 
to do anything that would raise 
prices, says Zhang. The problem is low demand 
for the expensive energy. “The solution is to 
force places, like Beijing, to buy it,” says Zhang. 
But the government hasn't brought itself to do 
that. It’s easier to let them stick with coal. 

Help might be on the way. The megabases 
have an economy of scale that will make them 
more attractive targets for grid developers. And 
in November last year, the country’s biggest 
power supplier, the State Grid Corporation of 
China, which provides some 88% of the nation’s 
power, announced plans to more than double its 
investment in grid infrastructure for 2009 and 
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2010 from 550 billion renminbi to 1.16 trillion 
renminbi. Regional projects will also help. In 
November this year, Gansu will put 20 billion 
renminbi into its grid to support its megabase, 
and Inner Mongolia, also set to get a megabase, 
plans to add 30 billion renminbi to its grid oper- 
ations by 2010. These will feed into a scheduled 
upgrade from 110 kilovolts to 750 kilovolts for 
a transmission line running from Xinjiang in 
the northwest, through Gansu, to the eastern 
metropolises. “It gives a feeling of optimism,” 
says Beaumont. 

Of course, hooking up wind turbines might 
not be the main thing on grid developers minds. 
Wind energy is currently only a drop in the 
bucket in China’s 793-gigawatt energy supply. 
In 2008, wind accounted for just 0.3% of China’s 
total electricity production, most of which was 
powered by coal. 

But if the five megabases go as planned, 
their combined capacity would provide more 
electricity than the controversial Three Gorges 
dam, even considering the dam's much higher 
capacity factor. Meeting government targets 
for renewable energy for 2020 will, accord- 
ing to most estimates, require a prodigious 

100 gigawatts of wind energy — about 5% 
of China's total energy supply. Some esti- 
mates note a potential of 500-600 gigawatts 
by 2040 to 2050. 

For now, the Chinese government 
has thrown its considerable weight into 
exploiting this resource. “It is its flagship 
renewable-energy industry, so the govern- 
ment is going to support it,” says Wu. But 
unless some dramatic changes are made 
soon, China’ plans for wind energy might 
get blown far off course. o 
David Cyranoski is Nature's Asia-Pacific 
correspondent. 


See Editorial, page 357. 
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Crisis communication 


Messages appear on Internet-based social networks within minutes of disasters occurring. 
Lea Winerman investigates how to harness this trend to create official community-response grids. 


he messages began to fly almost as 
soon as the bullets stopped. Starting at 
7:00 on the morning of 16 April 2007, 
an undergraduate named Seung-Hui 
Cho had carried a pair of semi-automatic 
pistols through the campus of the Virginia 
Polytechnic Institute and State University in 
Blacksburg — better known as Virginia Tech 
— gunning down dozens of students and pro- 
fessors as he went. By 9:51 it was over: Cho 
had turned one of his guns on himself. But the 
survivors, lacking any official word from the 
university other than the total death toll, were 
still in the dark about which of 
their friends had lived or died. 
So they turned to the best 
information source they had: 
the Internet — notably, the 
social website Facebook. 
Posts appeared in quick 
succession, indicating the 
names of suspected casualties. 
Those here have been edited for privacy rea- 
sons. “CH, as reported bya sorority sister,’ read 
a post on one Facebook page. “I just finished 
speaking with his girlfriend, and it appears JH 
is a fatality as well,’ read another post. As the 
information accumulated, the participants 
spontaneously began to develop their own 
norms to ensure accuracy. Anonymous or vague 
posters were asked for clarification. People had 
to identify themselves when they put forward 
the name ofa shooting victim, for example, and 
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explain where they had got the information. 

By the time the university released the names 
one day later, it was old news to the online 
community: they had identified all 32 of the 
deceased already. 

The Virginia Tech story is hardly unique. 
“When people are under threat, perceived or 
actual, they go into this intensified information- 
seeking period,’ explains Leysia Palen, a com- 
puter scientist at the University of Colorado, 
Boulder. And these days, they are increasingly 
doing so through social networking sites. 

But social-network users often end up 
bypassing the authorities — a 
tendency that has left officials 
scrambling to use this informa- 
tion and integrate it into tradi- 
tional responses. 

“Emergency managers have 
this desire to control the flow 
of information,” says Jeannette 
Sutton, a sociologist who also 

studies disaster communication at the Uni- 
versity of Colorado. “But you cant control it. 
The best we'll be able to do is figure out how 
to harness it.” 

Palen calls her research ‘crisis informatics, 
and it has taken her to several disaster hotspots. 
In the Virginia Tech case, for example, she and 
her colleagues began monitoring websites 
within hours of the shooting, then travelled to 
Blacksburg five days later to interview people. 
Back in Colorado, they created a detailed 
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timeline of official communications (univer- 
sity e-mails, press conferences) and unofficial 
communications (Facebook messages, Flickr 
photos) that stretched across a giant white- 
paper diagram on their office wall. 

This allowed them to track the emergence 
of those informal norms. “One of the concerns 
from the emergency-management point of 
view is ‘how can we know the information 
[posted on websites] is accurate?” Palen says. 
“We saw that here it was self-correcting” 


Spreading like wildfire 

In October and November 2007, Palen and 
her colleagues examined the online response 
to another crisis. During that time, more than 
20 wildfires raged from Santa Barbara to San 
Diego counties in southern California, even- 
tually burning 200,000 hectares, destroying 
about 1,500 homes and forcing many more 
households to evacuate. 

Palen and her colleagues monitored local 
news websites and online forums including 
Craigslist, Facebook, Twitter and Flickr. By 
the tenth day of the fires, they also began to 
survey and interview area residents. 

“National news websites were completely 
worthless as they ignored everything except 
the comparatively minor Malibu fire that 
burned near some celebrity homes,’ one 
evacuee wrote. 

And official government communications, 
although sometimes useful, couldn't be relied 
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on either. “The county so-called emergency 
site was always crashed,’ another wrote. 

Instead, many people turned to websites 
run by local media, such as the National Pub- 
lic Radio station KPBS, based in San Diego, 
or by individuals. On these sites, the updates 
came from any local resident with an Internet 
connection and information to share. Some 
hosted Google maps on which users could 
overlay information such as the location of 
the fires. Others hosted discussion boards on 
which people who hadn't evacuated, or who 
had made it back to their homes, could share 
what they were seeing. 


Worldwide webs 

Online information-sharing during crises isn’t 
limited to the United States, and other media 
agencies are already using the concept. In early 
January, for example, Al Jazeera, a satellite tele- 
vision network headquartered in Doha, Qatar, 
launched an experimental website that aggre- 
gates text messages, mobile-phone reports 
and Twitter feeds about the conflict in the 
Gaza Strip on a Microsoft Virtual Earth map 
(http://labs.aljazeera.net/warongaza). 

And Yan Qu, Philip Wu and Xiaoqing Wang, 
researchers at the University of Maryland in 
College Park, observed information-sharing 
trends after the 12 May 2008 earthquake in 
China's Sichuan province. 

“The earthquake happened at 14:28. Within 
one minute there was a message posted on a 
Tianya forum, says Wu. Tianya is one of 
China’s most popular websites — a bulletin- 
board system with more than 20 million 
registered users. Within 10 minutes, 56 discus- 
sion threads reported feeling the earthquake in 
22 cities throughout the region. 

The researchers read and classified the 


thousands of threads that 
appeared on the site in 
the days following the 
quake. 

They found that Chi- 
nese citizens used the 
site for many of the same 
purposes as Californians did 


during the wildfires: seek- Ggogle Mme =—<——— 
ing information about their "ge 
homes, hometowns and family, 7am. Mies. « 

and coordinating action to help TE Tesi sane 


victims of the crisis. And there 
were some widely reported fan 
success stories. In one, a college 


student saw news reports about aa 
the military having a difficult — @ 7AM es 


Amr Kenn es 
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time finding a spot to land a res- 
> Tapes 


cue helicopter near a destroyed 
village in the mountains. The 
woman, who had grown up 
in the area, posted a detailed 
description of a potential land- 
ing spot in an online forum, and begged users 
to forward it to authorities. The post eventually 
found its way to the military, who landed a heli- 
copter in the spot she described. 

Unfortunately, says Wu, that kind of story 
is rare. In general, professional emergency 
responders are only vaguely aware of how citi- 
zens use social media during disasters. “Even 
in the United States, professional emergency 
responders are just beginning to think about 
this. In China I don’t think there’s any kind of 
explicit effort,’ he says. 

It’s the lack of official involvement that Ben 
Shneiderman, a computer-science professor at 
the University of Maryland and his wife Jen- 
nifer Preece, dean of the university's College 
of Information Studies, are trying to change. 
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Researchers tracked social norms in the communications following the Virginia Tech shootings in 2007. 
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Community response grids (top) integrate official 
responses with information from the public, such 
as this Google map created by Chinese citizens 
during the Sichuan quake in 2008. 


In 2007, they published a one-page article in 
Science called 911.gov (B. Shneiderman and 
J. Preece Science 315, 944; 2007). Taking their 
title from the emergency telephone number 
for much of North America, they envisioned 
a web-based ‘community response grid’ that 
would combine the power of social network- 
ing sites with official government emergency- 
response systems. 

An emergency telephone system works ter- 
rifically under normal circumstances, says 
Shneiderman. “But when you get a Hurricane 
Katrina or a 9/11, [the call centres] become 
overwhelmed.” Potentially, at least, a website 
can handle such sudden surges more gracefully 
by tapping more servers as needed. 

As Shneiderman sees it, people would report 
incidents via the Internet or by sending text 
messages from their mobile phone rather 
than by calling 911. Software would aggregate 
those reports into a constantly updated map 
of the situation, which citizens and emergency 
responders could check without encountering 
clogged phone lines. People could also register 
to receive block-by-block information — again 
via e-mail or text message — about whether to 
stay in place, to evacuate or to respond in some 
other way. And people would be able to coor- 
dinate with their neighbours before, during 
and after emergencies on community message 
boards. So, for example, a family could agree 
to take responsibility for a wheelchair-bound 
neighbour during an evacuation. 
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Mobile-phone photos of the Sichuan earthquake in China aided the rescue efforts. 


Shneiderman is collaborating with Wu, 
Qu and others to explore how a small-scale 
version of the system would work on the 
University of Maryland campus. Wu says 
that the official emergency responders in the 
university Department of Public Safety are 
very interested in the idea — but they are also 
wary. They are concerned that they wouldn't 
have the staff to run the system, he says, and 
that it might confuse people who are already 
well-trained to call 911 in an emergency. Then 
there’s the possibility that people could use 
the system to spread bogus information and 
rumours. “If you have everybody able to do 
peer-to-peer, then there is widespread panic,’ 
one safety officer told the researchers. 

That reaction — interest tinged with 
scepticism — mirrors the reaction of many 
professional emergency responders. 

Federal and local disaster-response 
agencies have long operated under the Incident 
Command System, a standardized protocol 
developed in the 1970s by firefighters battling 
California wildfires, and later adopted by fed- 
eral agencies including the Federal Emergency 
Management Agency (FEMA). 

The system, with a clear, top-down chain 
of command, views communication with 
the public as a one-way street: information is 
supposed to flow from officials to the public 
via warnings sent out over 
TV, radio and other media. 
That view fits well with older 
academic research in disaster 
sociology. 

“Thirty years ago no one 
talked about crisis com- 
munication,’ says Dennis 
Wenger, a sociologist at the 
National Science Foundation who has studied 
disasters for more than three decades. “Back 
then the term was warnings research.” 

The questions that preoccupied research- 
ers then were how to write an effective 
warning — instructing people to evacuate 
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“How do you convince 
people that they are 


at risk? Only through 
other people.” 
— Dennis Mileti 


before a hurricane, for example — and how 
best to make sure everyone heard it. 

“But then all of a sudden came the Internet 
revolution, and it blew apart this notion ofa 
linear chain,’ says sociologist Kathleen Tierney, 
director of the Natural Hazards Center at the 
University of Colorado. Social-networking and 
photo-sharing sites, mobile phones and text 
messages have turned the chain into a web. 

In hindsight, that was only to be expected. 
The chain was never as linear as the models 
made it out to be. One of the first to realize that 
was Thomas Drabek, a disaster researcher at 
the University of Denver, Colorado, who sur- 
veyed evacuees after the South Platte River 
flooded near Denver in 1965. More than 60% 
of the people he spoke to told him that even 
after they received a warning telling them to 
evacuate, they tried to confirm it — checking 
with family and friends, talking things over, 
watching to see what their neighbours were 
doing — before taking action. 


Reaching an understanding 

Researchers have observed the same infor- 
mation-sharing inclination in many disasters 
since. After the bombing of the World Trade 
Center in New York in 1993, Benigno Aguirre, 
a sociologist at the Disaster Research Center of 
the University of Delaware in Newark, found 
that people who were in large 
offices took longer to evacu- 
ate than people who were in 
smaller groups, because it 
took them longer to come to 
a common understanding of 
what was happening. 

“Tt is actually very difficult 
to get human beings to per- 
ceive that they are at risk,’ says Dennis Mileti, 
a disaster-management researcher at the Uni- 
versity of Colorado. “How do you convince 
people that they are at risk? Only through other 
people.” And there, as long as the process isn’t 
too time-consuming, is one of the advantages 


© 2009 Macmillan Publishers Limited. All rights reserved 


of the Internet. “Long before technology, you 
could check in with neighbours next door,’ says 
Sutton. “But now you can check in with peers 
around the country.” 

Garry Briese — a regional administrator 
for FEMA in Denver and, for 22 years before 
that, the executive director of the International 
Association of Fire Chiefs — is excited about 
the opportunity that public feedback offers. He 
recounts the example of a wildfire in Deckers, 
Colorado, last summer, when he was able to 
find photos of the fire posted online by local 
residents within 30 minutes of hearing about 
the fire himself. 

“Tt expands our ability to have situational 
awareness, he says. “Someday, in a fast-moving 
wildfire, if we had the right system, we could 
ask people by a text alert to take pictures and 
send them to an emergency centre. And our 
ability to understand what the scene looks like 
would be enhanced.” 

That scenario is still a long way from reality, 
says Briese, who, in July 2008, talked to an offi- 
cial from a large city’s emergency operations 
centre about the information already available 
on the web. “A few days later, the guy called 
me and said ‘you know, my IT department has 
me blocked from those sites;” he says. “So here 
we have an emergency manager who wants to 
learn, and the first thing he has to do is go to 
the IT department and convince them.” 

Sutton encountered something similar when 
she spoke about her research at a conference of 
state-level emergency managers in September 
2008. “I had a couple of people say to me, “This 
is so new to us, we don't even know what to 
ask,” she says. 

In any case, answers might not have been 
easy to supply. “This information is out there, 
and you can find it online,” Sutton says. “But 
how do you aggregate it, and where would it 
get plugged into? Would it go back to an emer- 
gency operations centre, a joint information 
centre, a public information office?” 

When Shneiderman proposed community 
response grids in 2007, he thought that it might 
become a reality within 3-5 years. Now, with so 
much research still to do, he thinks 5-10 years 
might be more accurate. But he is used to wait- 
ing. He helped to develop the concept of hyper- 
text links in the 1980s and saw them become 
integral to the web more than a decade later. 

Not everyone has Internet access or the 
technical knowhow to take advantage of 
it. But, during a disaster, a community- 
response grid could benefit almost everyone, 
as family, friends, neighbours and authori- 
ties share what they know. “I think this is 
inevitable,” says Shneiderman. a 
Lea Winerman is science editor for The Online 
NewsHour with Jim Lehrer. 
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OPINION 


CORRESPONDENCE 


Scientists stand by 
decision to join 
Mbeki's AIDS panel 


SIR — Your Editorial ‘The cost of 
silence?’ (Nature 456, 545; 2008) 
questions our decision — as 
scientists who opposed dissident 
theories — to participate in the 
now-discredited AIDS advisory 
panel set up by former South 
African president Thabo Mbeki. 
We had no role in, and did not 
see or approve, the panel's report 
(Nature 410, 730; 2001). Each 
of us had agreed independently 
to join the panel, guided by our 
consciences as scientists in a 
young democracy and without 
prior knowledge of who else had 
been invited to participate. To 
us, it provided an opportunity to 
present Mbeki with the alternative 
viewpoint and the compelling 
scientific evidence that HIV 
causes AIDS, in the hope that 
rationality would prevail. 
However, Mbeki’s antipathy 
to antiretroviral drugs was 
influenced by documents from 
and interactions with AIDS 
dissidents that predated the 
setting up of the panel. We 
underestimated the strength of 
his dissident views on AIDS and 
how little impact sound science 
would eventually make on them. 
Sadly, our advice to Mbeki on 
AIDS causation and antiretroviral 
treatment was rejected. We 
cannot, therefore, be numbered 
among those held accountable 
for Mbeki's decisions, which led 
to the loss of many thousands of 
lives in South Africa through lack 
of access to antiretroviral therapy. 
That we failed to change 
Mbeki’s opinions on AIDS is a 
matter of record. But your Editorial 
is unreasonable in implying — 
with the benefit of hindsight 
— that scientists could have 
foreseen this failure and therefore 
should not have signed up to an 
opportunity to give the president 
critically important information 
that might have saved the lives of 
their fellow-countrymen. 
We stand by our decision to 
participate in the Mbeki panel. We 


have an obligation to our country, 
which is suffering the worst AIDS 
epidemic in the world, to do 
everything in our power to provide 
our political decision-makers with 
the best scientific advice, whether 
or not they are a priori opposed to 
or supportive of our views. 

Salim S. Abdool Karim, Hoosen M. 
Coovadia, Malegapuru W. Makgoba 
CAPRISA, Doris Duke Medical 
Research Institute, Nelson R. Mandela 
School of Medicine, University of 
KwaZulu-Natal, Private Bag X7, 
Congella 4013, South Africa 

e-mail: karims1@ukzn.ac.za 


When winning a 
Nobel prize seems to 
run in the family 


SIR — In his Correspondence 
‘You're the best man for this job, 
son. What a coincidence!’, Albert 
Ruggi’s suspicions about the 
process by which the offspring of 
professors are deemed to be the 
best candidates for new positions 
may well be justified (Nature 456, 
870; 2008). On the other hand, a 
few rare families just do produce 
generations of eminent scientists. 
For example, there are at least 
seven parent-child pairs of Nobel 
laureates. 

Four of these were in physics: 
the Thomsons (J. J. in 1906 and 
George in 1937), Braggs (William 
and Lawrence together in 1915), 
Bohrs (Niels in 1922 and his son 
Aage in 1975) and Siegbahns 
(Manne in 1924 and his son Kai 
in 1981). Marie Curie and her 
daughter lréne Joliot-Curie both 
won the Nobel Prize in Chemistry 
(1911 and 1935), after Marie and 
her husband, Pierre, had won the 
physics Nobel in 1903. 

The Kornbergs branched out 
more (Arthur, physiology or 
medicine, 1959; Roger, chemistry, 
2006), as did Hans von Euler- 
Chelpin (chemistry, 1929) and his 
son Ulf von Euler (physiology or 
medicine, 1970). 

Jay M. Pasachoff California 
Institute of Technology 150-21, 
Pasadena, California 91125, USA 
e-mail: jmp@caltech.edu 


Ecologists should join 
astronomers to 
oppose light pollution 


SIR — In his Commentary ‘Time 
to turn off the lights’ (Nature 457, 
27; 2009), astronomer Malcolm 
Smith argues for darker skies. 
Ecologists would also do well to 
support the International Year 
of Astronomy, considering the 
potentially severe impact of light 
pollution on some biological 
systems. Moths, for example, as 
well as the bird migrations Smith 
mentions, are adversely affected 
by light pollution. 

In most instances, the origin 
of atmospheric and ecological 
light pollution is identical. So 
ecologists should back initiatives 
stimulated this year by physicists, 
to help raise awareness of the 
undesirability of light pollution 
across different disciplines. 
Josef Settele UFZ, Helmholtz Centre 
for Environmental Research, Theodor- 
Lieser-Strasse 4, 06120 Halle, Germany 
e-mail: Josef.Settele@ufz.de 


Lindauer's genius 
showed evolution in 
a simple experiment 


SIR — | was saddened to learn of 
the death of Martin Lindauer, an 
under-appreciated hero of science. 
Thomas D. Seeley, in his Obituary 
(Nature 456, 718; 2008), describes 
experiments from Lindauer's 
Communication Among Social Bees 
(Harvard Univ. Press, 1961) that 
demonstrate his talent. The book 
also includes experiments that 
possess what physicist I. |. Rabi 
used to call ‘witz’: an unexpected 
twist that elevates an experiment 
to a higher level. 

Living in an enclosed, sheltered 
space, the honeybee Apis mellifera 
performs its communicative 
dance ona vertical surface in the 
dark, using gravity as a substitute 
for the direction of the Sun. By 
depriving them of a vertical 
surface and giving them a direct 
view of the Sun, Lindauer forced 
them to revert to the more 
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primitive, Sun-directed dance 
of their dwarf Indian relative, 
Apis florea. 

In closing the gap between a 
primitive and an advanced 
condition, Lindauer possibly 
produced the best-ever 
experimental evidence for 
evolution. Scientists concerned 
with evolution of human language 
and mind might ponder his 
SUCCESS. 

William L. Abler Department of 
Geology, The Field Museum, 
1400 South Lake Shore Drive, 
Chicago, Illinois 60605, USA 
e-mail: wabler@fieldmuseum.org 


Culture clash in 
Chinese university: 
a response 


SIR — In your Editoral ‘Culture 
clash in China’ (Nature 456, 
545-546; 2008), you incorrectly 
say that |am professor emeritus, 
having retired from the College of 
Life Sciences, Peking University, 
four years ago. In fact, | retired in 
February 2006 and do not have 
emeritus status. Neither did 

| retain my laboratory there in 
order for my associate professor 
to take it over formally as a way of 
maintaining my influence. 

| have kept my laboratory 
running with the help of a grant 
from the National Natural Science 
Foundation of China (NSFC). 
When | applied to the NSFC, Peking 
University guaranteed my lab and 
equipment until | had completed 
the work. The associate professor 
you mention was a co-author on 
this grant application. 

Although | did submit an online 
posting accusing Yi Rao, the dean 
of life sciences, of withdrawing 
the laboratory for use in other 
applications (http://tinyurl. 
com/8l4u9x), | have never 
proposed that the associate 
professor should take it over. My 
aim is that he should be able to 
use it to continue his research. 
Keming Cui College of Life Sciences, 
Peking University, Beijing 100871, 
People’s Republic of China 
e-mail: ckm@pku.edu.cn 
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OPINION BEING HUMAN 


Kinship: Race relations 


Our notions of family, population and race may need revising in the age of personal 
genomics, argues Aravinda Chakravarti. 


Genealogical records are currently the system 
of choice for people tracing their family his- 
tory. But in the next decade, we will be able to 
identify many of our relatives by searching a 
DNA database of personal genome sequences. 
There are good reasons for switching to DNA: 
in general, historical records cover at most the 
past 500 years; our genomes, in contrast, bear 
the stamp of tens, if not hundreds of thousands 
of years of history. Even individuals without 
genealogical records will be able to correctly 
create a family tree with connections to known 
relatives, to those they were unaware of, and to 
relatives so distant that they stretch the mean- 
ing of the word ‘family. 

Two developments are making it possible for 
geneticists to begin homing in on the patterns 
of relatedness between the world’s seemingly 
diverse people. The first is the discovery that 
we share very recent common ancestors. Until 
the late 1980s, our early hominin ancestors, 
from a few million years ago, had been found 
in many locations, and modern humans were 
thought to have arisen from the local evolution 
of these species in different parts of the world. 

In 1987, Allan Wilson and his students pre- 
sented a new and surprising human history: their 
comparison of only a fraction of our genome — 
that found in the mitochondria — ina handful 
of Africans, Asians and Europeans, showed that 
all living humans are related via a set of common 
ancestors who lived in Africa about 200,000 
years ago. Other studies have since shown that 
the world beyond Africa was settled even more 
recently. From 100,000 years ago, descendents 
of our African forebears spread out to populate 
other continents (the New World, perhaps as 
recently as 25,000 years ago), with the lineages 
from different settler groups eventually mixing 
through further migration. 

The striking implication of this is that all 
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living humans are mosaics with ancestry from 
the many parts of the globe through which our 
ancestors trekked. In other words, each of us 
has around 6.7 billion relatives. 

The second change that is allowing geneti- 
cists to piece together human ancestry is 
remarkable progress in our ability to study 
DNA. Driven mainly by the desire to find 
genetic traits that underpin com- 
mon diseases, extraordinarily 
rapid technological advances in 


“The global picture 
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don't know the identity of their genetic father. 
Also, in the case of isolated people such as 
the Old Order Amish, all individuals share a 
small group of common founder ancestors. 
So a social relationship, such as first cousin or 
parent, is an imperfect guide to their genetic 
relatedness. 

The only way to assess biological kinship 
with any certainty is to look for 
the stories of ancestry marked 
indelibly in a person’s DNA. 


DNA and computing analysis are of relatedness that In the past decade, techno- 
allowing geneticiststocompare ISemerging from logical advances have reduced 
more than a million markers, DNA studies stands the costs of examining entire 
or variable DNA sites, in peo- to shatt f uman genomes 1,000-fold or 
ple’s genomes, and to make such Wee UIEDN be more. These have largely been 
comparisons between hundreds OUF beliefs about driven by a desire to identify 
of thousands of people. ourselves.” the genes underlying common 


The global picture of related- 
ness that is emerging from DNA studies, in the 
context of established facts about our recent 
common ancestry, stands to shatter many of 
our beliefs about ourselves. In particular, it 
calls into question existing ideas about popu- 
lations and race. 


Extended families 

Anthropologists and sociologists have 
conventionally assessed kinship by asking peo- 
ple about their social relationships with oth- 
ers. This provides a notoriously incomplete, 
perhaps even erroneous, picture of biological 
relatedness for living humans, and little more 
than myth or speculation when used to assess 
the connections between assumed forebears. 
As the saying goes, “Maternity is a matter of 
fact, paternity a matter of opinion.” In the 
United States, findings from organizations 
that assess paternity, for example to persuade 
fathers to support their disputed yet biological 
children, suggest that at least 1 in 20 people 
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chronic diseases or adverse 
drug reactions. Already, more than a million 
marker sites in the human genome, have been 
examined in some 100,000 people to identify 
more than 300 novel common disease factors. 
And, increasingly, researchers are sequencing all 
3 billion DNA base pairs of the human genome; 
one international study, the 1,000 Genomes 
Project, aims to sequence the genomes of 1,000 
people over the next two years. 

However, the interactions between myriad 
genetic and environmental factors seeming to 
underpin most common diseases are proving 
to be highly complex. As a result, large-scale 
comparisons of people’s DNA may be bring- 
ing geneticists closer to understanding how 
the world’s people are related to one another, 
than to establishing the causes of common 
diseases. 

DNA studies scour the genomes of 
‘unrelated’ patients for common genetic 
patterns that are absent in other ‘unrelated’ 
people without the disease. For example, my 
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from genetic studies is that we are § 
all multiracial, related to each 2 
other only to a greater or lesser ° 


colleagues and I have examined the genomes 
of 16,000 people to identify a region on chro- 
mosome 1 that harbours a gene affecting the 
risk of sudden cardiac death (in combination 
with many other genetic and environmental 
factors). But our finding that markers in this 
region are over-represented among patients 
compared with those who are disease-free 
indicates that the disease arises, in part, from 
shared ancestry. In other words, the disease 
runs in families, even though the family links 
may be thousands of years old. 

Even putting aside the disease factor, we 
can uncover both the proximal and remote 
ancestral relationships of any two of these 
16,000 people by comparing the degree and 
pattern of similarity across their genomes. 
Indeed, comparisons of millions of markers, 
and certainly of entire genomes, will identify 
far more specific relationships between stran- 
gers than has been uncovered by the ancestry 
tests now in vogue. 

Increasingly, customers pay companies to 
convert their DNA into ancestry information. 
But most if not all such pictures of relatedness 
are based on markers on the mitochondrial 
genome (inherited only from mothers) or on the 
Y chromosome (inherited only from fathers). 
These represent a minute fraction of our genetic 
inheritance (less than 1%), so give a highly 
incomplete picture of relatedness. Companies 
also tend to compare people’s DNA to sequences 
held in their own private databases, which are 
currently too small to uncover more than the 
continental origin ofa person’s ancestors. 

It is not inconceivable that studies inves- 
tigating the genetic basis of diseases will 
reveal people’s previously unknown cous- 
ins, siblings, half-siblings or even parents. 
Human geneticists are bound by consent 
forms not to reveal cases of mistaken pater- 
nity if they discover them. But if databases 
of DNA sequence information become pub- 
licly available, just as genealogical records are 
now, people will be able to compare their own 
genome sequence with those of millions of 
others. Children born to mothers artificially 
inseminated by an anonymous donor could 


potentially discover their numerous half- 
siblings. Even in the case of remote relation- 
ships, people may interact, perhaps through 
online social networking, with newly found, 
distant relatives regardless of their culture, 
politics and race. Such a scenario is increas- 
ingly plausible given people’s willingness to 
share personal information online, for exam- 
ple in social-networking websites. 


Only skin deep 

Perhaps the most striking consequence of 
more and more people having their entire 
genome examined for genetic variation is the 
blurring of our concept of discrete human 
populations. Current thinking, championed 
by anthropologists and buttressed by old 
genetic data, is that human populations are 
intact groups that have had their own language 
and culture for eons. In fact, the population 
is thought to define an individual’s genetic 
identity, and kinship between 


extent. More detailed data on 
genetic variation, along with an 
improved sampling of humanity, 
are showing continuity in variation 
across the globe, not abrupt transitions 
between population-specific sequence pat- 
terns. Differential population growth, about 
10,000 years ago, based on the evolution of 
agriculture, technology and politics seems to 
have made sparse isolates of our species into the 
‘major’ groups of today. In other words, except 
for immigrants, kinship between two humans 
seems to be directly related to the geographical 
distance between their birthplaces. 

An even clearer, and unbiased, picture of 
humanity’s genetic diversity and relation- 
ships would emerge if geneticists focused on 
individuals instead of populations. This may 
involve sampling humans randomly across a 
grid, and then assessing their individual and 
group features (such as birth place, parental 
birth places, language and group affiliations). 
Genome-wide studies carried out in this way 
could result in individual identity and kinship 
coming to define populations rather than the 
other way around. We could test once and for 
all whether genetic race is a credible concept. 

This would be tremendously exciting. It is 
bound to stir up our deeply held notions of 
who we are, where we came from, our history 
and thus our politics. More often than not, the 

views of society have shaped 


individuals is considered only “Weareall science rather than the other 
within the context of this or that siete way around. In this instance, 
group. It’s within this context that multiracial, related it may be time for science to 
people trace their ancestry using tO each other only reshape the views of society. By 
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that the discovery that President lesser extent." and population, we may better 


Barack Obama is related to Dick 
Cheney makes news. 

Currently, the population view dominates in 
genetics because researchers sample clusters of 
individuals from distantly related groups. The 
clearly observable, or measurable, physical and 
genetic differences between people are espe- 
cially marked when people from the periph- 
eries of the spectrum of human variation are 
compared — so, for example, when Africans 
are compared with Europeans or Asians. 

Race has long been a socio-political 
construct. But by focusing on the effects of 
natural selection on genes whose effects are vis- 
ible, and sampling people from the extremes of 
human diversity, geneticists have unwittingly 
(and sometimes wittingly) added credence to 
society's views on separateness by genetically 
characterizing racial categories. 

However, the current picture emerging 
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appreciate our common, shared 
and recent history, and perhaps 
more importantly, our shared future. Overhaul- 
ing such concepts in the light of genetic research 
is particularly important if we are to accommo- 
date the changing face of human groups around 
the world thanks to increased immigration to 
distant lands. Such migration has happened 
before, as our genomes show, only slowly, over 
150,000 years. a 
Aravinda Chakravarti is at the McKusick- 
Nathans Institute of Genetic Medicine, Johns 
Hopkins University School of Medicine, Broadway 
Research Building, Room 579, 733 North 
Broadway, Baltimore, Maryland 21205, USA. 
e-mail: aravinda@jhmi.edu 


See http://tinyurl.com/8ub4f4 for further 
reading. For more on Being Human, see 
www.nature.com/nature/focus/beinghuman. 
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Evolution’s challenge to genetics 


Do conjoined twins and two-legged goats suggest a minor role for genetics in evolution? The evidence is not 
strong enough to upset the orthodox view, argues Jerry A. Coyne. 


Freaks of Nature: What Anomalies Tell Us 
About Development and Evolution 

by Mark S. Blumberg 

Oxford University Press: 2009. 344 pp. 
£12.99, $22.95 


In 1980, an evolution meeting 
at the University of Sussex, UK, 
featured several speakers who 
questioned the importance of 
genetics in understanding evo- 
lution and development. The ‘structuralists’ 
saw the adaptations of many organisms as 
products of self-organizing molecules rather 
than natural selection. Others touted the 
epigenetic view, claiming that important evo- 
lutionary change involved heritable features 
not coded in the DNA. Perturbed, the distin- 
guished embryologist Lewis Wolpert stood up 
and proclaimed that he too had a radical and 
heretical view: “Genes control development.” 
Wolpert was puckishly defending what he saw 
as a perfectly adequate paradigm against those 
who minimized the importance of genes. To 
emphasize his point, he switched the lights 
on and off during the coffee break — but the 
structuralists refused to admit that the switch 
controlled the lights. 

The past three decades have vindicated 
Wolpert. Virtually all of the major advances in 
evolutionary developmental biology, or ‘evo- 
devo, have been firmly grounded in genetics. 
These include studies — two of them awarded 
a Nobel prize — on how genes organize body 
plans, how genes are regulated and how the 
same genes, such as PAX6, are recruited in the 
independent evolution of similar structures in 
different species. 

But as evo-devo blossoms, the anti-genetics 
strain persists. Evo-devo is undergoing the kind 
of spasm experienced by palaeontology when 
Stephen Jay Gould and others decided that 
their field had been overlooked in the modern 
evolutionary synthesis, or worse, forced to con- 
form to the theories of population geneticists 
and others at the heart of the synthesis. Gould 
suggested that palaeontology, when uncoupled 
from its overbearing cousins, would provoke 
a major revision of our understanding of the 
evolutionary process. Gould was wrong: the 
neo-Darwinian perspective emerged unscathed 
from its encounter with punctuated equilib- 
rium. Nowitis the turn of evo-devo to challenge 
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It is difficult to see how developmental quirks are relevant to evolution — even if you have two heads. 


neo-Darwinian orthodoxy. As Mark Blumberg 
declares in Freaks of Nature, evolutionary biol- 
ogy should not just describe developmental 
phenomena, but should also incorporate new 
evolutionary processes that de-emphasize the 
role of genetics. 

By presenting a parade of animal ‘freaks’ — 
mutants, developmental anomalies and weird 
species — Blumberg imparts lessons that, 
although familiar to biologists, will be valu- 
able to non-specialists. He emphasizes that the 
complex process of development can be unrav- 
elled by understanding how such anomalies are 
produced. Conjoined twins, for example, tell 
us something about how identical twins form. 
He highlights, correctly, that there is no direct 
correspondence between genes and traits: 
there is no such thing, for example, as the gene 
for the thumb, although some mutations can 
create new thumbs. He also stresses that evolu- 
tion can work only by modifying development, 
and that natural selection can create individu- 
als whose development responds adaptively 
to the environment — as in the case of rotif- 
ers, some species of which develop spines in 
response to predators. 

Blumberg illustrates his points with clear 
and intriguing examples. We learn that female 
hyenas lack vaginas but have huge, penis-sized 
clitorises through which they copulate and give 
birth, often resulting in high infant mortality. 
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We meet Abigail and Brittany Hensel, 18-year- 
old conjoined twins from Minnesota — two 
heads on a single body — whose touching 
story can be seen on YouTube (see http:// 
tinyurl.com/26bpm9). And we discover that 
the nervous system of rabbits can be entrained 
to make them walk rather than hop, implying 
that hopping is not genetically encoded but a 
by-product of the rabbit’s leg structure. 

Blumberg’s ambitions transcend storytelling: 
he aims to show that developmental biology 
has made real contributions to evolutionary 
theory. The theory’s problem, as Blumberg 
maintains, is its “gene-centered, population- 
level thinking’, also described as “simplistic 
single-cause, gene-centered thinking” What 
paradigms, then, should supplant our mis- 
guided embrace of Gregor Mendel? 

The first is epigenetics. Blumberg notes 
that larger male dung beetles roll larger balls 
of dung, which in turn nurture larger sons. 
He argues that “Such examples of nongenetic 
transmission of characters are now becom- 
ing commonplace and are helping solidify the 
notion that the heredity upon which evolution 
depends is more than just about genes.” But 
we must be careful. Some adaptive ‘epigenetic’ 
phenomena, such as parental imprinting of 
chromosomes, which influences gene expres- 
sion depending on which parent passed on the 
gene, are based on instructions in DNA. Other 
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cases of epigenesis, such as the conformational 
changes of prion proteins, are of minor evo- 
lutionary significance. Still others, such as an 
ancestral cell's ingestion of the bacteria that 
evolved into mitochondria, were of immense 
importance in evolution but are infinitely 
rarer than adaptive changes based on genes. 
And in nearly all cases, epigenetic effects peter 
out after a few generations, unable to promote 
major evolutionary change. Perhaps the most 
serious criticism of epigenetics is that of the 
thousands of inherited mutations found in 
model organisms such as mice and fruit flies, 
virtually all reside in DNA. 

The second paradigm involves “the self-right- 
ing properties of developmental systems’, also 
known as phenotypic accommodation, which, 
says Blumberg, lead to evolutionary innova- 
tion. To support this, he trots out the two-legged 
goat described in 1942 by Dutch veterinarian 
E. J. Slijper. The goat, a developmental anomaly 
born without forelegs, learned to hop on its 
hindlimbs like a kangaroo. When the goat died 
in an accident — some say in an ill-advised 
experiment to see if it could walk downstairs — 
Slijper’s autopsy showed that it had undergone 
modifications of its spine, hindlimbs, muscles 
and neck that facilitated its bipedal hopping. It 
has been suggested that this hobbled beast is a 
model for the origination of bipedality in some 
lineages, even humans. Although Blumberg 
admits that this is unlikely — after all, there 
are perfectly good, and more parsimonious, 
Darwinian explanations for bipedality — he 
approvingly quotes anatomist Pere Alberch: 
“The regulatory capacities of an epigenetic sys- 
tem imply that any intrinsic change will trigger a 
sequence of regulatory changes to automatically 
generate an integrated phenotype.” But as this 
integrated phenotype was not based on genetic 
differences from any other goat, it could not be 
transmitted to offspring, and its relevance to 
evolution is unclear. The phenotypic changes 
in Slijper’s goat did not result from some inher- 
ent self-regulating property of development. 
Rather, they reflect an evolved phenomenon: 
natural selection has given bones and muscles 
the adaptive property of developing in response 
to the stresses they experience. 

Blumberg’s final alternative to conventional 
evolution is genetic assimilation. As with 
phenotypic accommodation, here the pheno- 
type changes before the genes. During assimi- 
lation, an initial environmental change alters 
the phenotype of many individuals, exposing 
previously hidden genetic variation that can 
then be selected. Eventually, what was an envi- 
ronmental change becomes genetic, mimick- 
ing the inheritance of acquired traits. 

Social learning is one way to start this proc- 
ess. For example, in the 1920s, two species of 


British tits learned by mass imitation to pry up 
the foil on milk bottles and drink the cream on 
top. Were home milk delivery still common, 
one can imagine that this propensity might have 
become genetically assimilated. Individuals 
with greater abilities to learn the behaviour, or 
perform the actions needed, would be favoured 
by selection. Eventually, cream pilfering would 
become innate — coded in the genes. Many 
adaptations might have started in this way; fish, 
for example, may have evolved adaptations for 
living on land after some individuals discov- 
ered terrestrial insects to bea rich food source. 
But we can also explain such cases by invoking 
simple selection on pre-existing genetic varia- 
tion. In the absence of a single credible example 
of genetic assimilation in nature, it remains an 
appealing but untested speculation. 


In the end, the problem with these explana- 
tions is not so much that they are wrong, or of 
no potential importance in evolution. Rather, 
it is that Blumberg gives the impression that 
they are established truths rather than hypoth- 
eses that have remained unconfirmed for three 
decades. In his anxiety to boost the status of 
evo-devo in the pantheon of evolutionary 
subdisciplines, Blumberg has short-changed 
orthodoxy. Not only does the traditional view 
of evolution explain far more than he allows, 
but Blumberg shapes his own vision of develop- 
ment to inflate its challenge to neo-Darwinism. 
I, for one, am with Wolpert. | 
Jerry A. Coyne is a professor in the Department 
of Ecology and Evolution at the University of 
Chicago, Illinois 60637, USA. 
e-mail: j-coyne@uchicago.edu 


The future is now 


“With great power comes great responsibility,” 
uttered Stan Lee’s comic-book superhero 
Spider-Man in his first published appear- 
ance in Amazing Fantasy in 1962. Since then, 
science has advanced to such a point that 
the human body can be enhanced in ways 
that mimic fiction. Cloning, face transplants, 
prosthetic limbs, brain-machine interfaces and 
cognition-enhancing drugs promise utopian 
or Orwellian visions of the future, depending 
on your outlook. It is the scientific community, 
not the superhero, that holds 


from the present. Many of the essays focus on 
society's willingness, or not, to embrace new 
technologies. Philosopher Russell Blackford 
argues convincingly that a fear of new tech- 
nologies is groundless and erodes liberalism. 
He notes that a governing state can allow new 
advances, such as choosing the sex of one’s child, 
without endorsing the underlying technology 

or morality of an individual's choice. 
The concept of human enhancement is 
investigated by ethicist Ruud Ter Meulen. Cer- 
tain drugs such as modafinil, 


the great responsibilities of Human Futures: ; used to treat narcolepsy, 
our age and the next. Art inan Age of Uncertainty have been shown to improve 
Two books, Human Futures Edited by Andy Miah cognition and are becoming 


and The Science of Heroes, use 
a crystal ball to imagine how 
science will determine the 
future of human existence 
and society. 

Human Futures is a varied 
collection of meditations on 
the notion of humanity from 
researchers, artists, philoso- 
phers and even a Blood Elf 
priest from the online role-playing game World 
of Warcraft. The book is born of the Human 
Futures programme of the Foundation for Art 
and Creative Technology, based in Liverpool, 
UK. The programme comprised a series of 
exhibitions and lectures for public debate, in 
which creative assemblages of artists, philoso- 
phers and scientists explored questions of what 
we are now, and what we will become. 

The first of the book’s four sections, entitled 
‘Visions, perhaps best succeeds at describing the 
problems of the future using intelligent insights 


368 pp. £35 
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Liverpool Univ. Press: 2008. 


The Science of Heroes: 
The Real-Life Possibilities 
Behind the Hit TV Show 
by Yvonne Carts-Powell 
Berkley Boulevard Books: 
2008. 288 pp. $15 


increasingly popular with 
students revising for exams. 
If everyone takes the drug in 
future, then what is the new 
norm? Will it make us better, 
or just different? 

Physicist Richard Jones 
asks how humans have been 
enhanced by technology. 
He observes that the public 
accepts the benefits of gadgets while increas- 
ingly rejecting the scientific world view. As a 
result, he explains, per capita energy use in the 
United Kingdom has risen from 20 gigajoules 
per person in 1800 to nine times that figure 
today. Life expectancy, he notes, is strongly cor- 
related with energy use. Such a rise in energy 
use looks unsustainable at present, but Jones 
asserts that technology is a product of society, 
not a runaway automaton, and solutions will 
come as longas the energy flows. 

Oddly, the section concludes witha delightful 
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Many of the ‘superpowers’ portrayed in the television series Heroes are no longer science fiction. 


but misplaced essay on ‘evidence dolls; the 
creation of designers Anthony Dunne and 
Fiona Raby. The plastic miniatures are a 
hypothetical future product in which to store 
sperm and hair samples of prospective repro- 
ductive partners. The dolls are personified by 
four women, who reveal how knowledge of 
their partner’s genes might influence their 
sexual and reproductive lifestyles. 

Two other essays that centre on human 
enhancements — such as the lower-leg pros- 
theses that allow South African runner Oscar 
Pistorius to compete with able-bodied athletes 
— are left to later sections. This misfiling of 
the book’s content makes its arguments and 
themes hard to follow. Sociologist Steve Fuller’s 
history of humans playing God is a thoughtful 
reminder of how badly we have handled aspects 
of divinity in the past, but it belongs nearer to 
Pramod Nayar’s narrative on post-human 
rights. Both essays grapple with the prospect of 
endowing legal and moral status to our cyber- 
netic or genetically enhanced descendants. 

The book's over-designed layout — a hybrid 
of uber-stylish photography mixed with ele- 
ments of the record sleeve from Radiohead’s 
seminal album OK Computer — does not aid 
the reader experience. Indeed, the voice of 
musicians is absent, which weakens the book’s 
proclamations of diversity. 

The Science of Heroes explores similar themes, 
but in avery different style. It uses the vehicle of 
Heroes, the popular sci-fi television series that 
follows the moral chaos inflicted on a clutch of 
people who have incredible powers proportional 
to their youth and good looks. In the book, 
Yvonne Carts-Powell propels the reader on an 
enthusiastic and entertaining journey through 
the realms of biology and physics that might one 
day produce the first genuine super people. 

And that day might come sooner than we 
think. The regenerative powers of Heroes 
character Claire are ones we already possess — 
we are just slower to heal. Hiro’ teleportation 
has been achieved, at least in the subatomic 
world, and metamaterials with negative refrac- 
tive indexes promise invisibility for all. Other 
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powers, such as the ability to steal memory, are 
already with us in the form of drugs that help 
victims of trauma to forget. 

Historically, superheroes are a snapshot of 
the relationship between science and society 
at any one time, based on their powers and 
how they obtained them. From the perfect 


Superman born of a 1930s United States 
adopting eugenic practices, to the nuclear- 
powered Spider-Man of the cold-war era and 
the psychologically disturbed Batman incar- 
nations of the therapy-obsessed 1980s and 
1990s, every generation has its super people. 
Those portrayed in Heroes are the genetically 
enhanced Generation X of the superhero 
world — too busy to save the world because 
their own lives are in perpetual turmoil. 
Societies get the superheroes they need 
most. Human Futures and The Science of 
Heroes give a tantalizing glimpse of how 
science might make this a reality in a future 
human existence. And given Carts-Powell’s 
talent for explaining the science, perhaps an 
army of her clones will take over high-school 
education. a 
Arran Frood is a science writer based in the 
United Kingdom. 
e-mail: arranfrood@gmail.com 


Is there life on Europa? 


Unmasking Europa: 

The Search for Life on Jupiter's Ocean Moon 
by Richard Greenberg 

Praxis/Springer: 2008. 278 pp. 
£17.50/$27.50 


In the field of astrobiology, the discovery of life 
beyond Earth sits like a gem inside the nested 
Russian dolls of physics, geology, chemistry 
and, ultimately, biology. Efforts to understand 
the habitability of worlds within our Solar 
System began with physical and astronomical 
surveys, and have now moved on to the chal- 
lenge of cracking open the geological secrets 
of key destinations such as Mars and the large, 
icy moons of Jupiter and Saturn. 

Understanding the geological context for 
life is critical. Rock cycles, whether they are of 
silicates or ices, enable chemical cycles that can 
then be exploited by biological systems. Such 
cycles are central to life on Earth. On Mars, the 
demise of mantle convection may have led to the 
planet becoming cold and dry. Near the giant 
planets of the outer Solar System, and perhaps 
around massive extrasolar planets, rock cycles 
may be driven by the gravitational squeezing 
of icy moons due to tidal interactions. On icy 
moons such as Jupiter’s Europa, the mixing of 
irradiated, oxidant-rich surface ice with a water 
ocean could maintain a chemically rich envi- 
ronment capable of sustaining life. 

In Unmasking Europa, planetary scien- 
tist Richard Greenberg details in depth our 


© 2009 Macmillan Publishers Limited. All rights reserved 


geological understanding of the tidally 
tormented icy surface of Europa. Without pull- 
ing any punches, he also describes the equally 
tormented scientific debate that has led to the 
current canon. More than a decade after the 
Galileo spacecraft returned magnetic-field and 
gravity data that strengthened the case for a sub- 
surface, liquid-water ocean on Europa, we still 
do not know whether that ocean lies beneath 
an ice shell just a few kilometres thick or a shell 
with a thickness of more than ten kilometres. 

From an astrobiology perspective, a thin 
shell could permit direct cycling of oxidant-rich 
ices with the ocean. A thick ice shell, however, 
would impede the cycling of surface material, 
possibly limiting the chemical energy available 
to any life below the surface. On this conten- 
tious debate over the ice thickness, Greenberg 
notes, “by itself, modelling of heat transport on 
Europa is too uncertain to definitively discrimi- 
nate between thin conductive or thick convec- 
tive ice”. However, on the basis of a host of 
geological features observed in images from the 
Voyager and Galileo missions, many of which 
are reproduced in the book, Greenberg argues 
compellingly that only a thin shell is consistent 
with the observed ridges, cycloidal features and 
chaotic terrain of Europa, all of which can be 
explained through tidal dynamics. 

Although Greenberg occasionally strikes an 
acerbic tone when describing scientific differ- 
ences with those on what he calls the thick- 
ice bandwagon, his motivation seems noble. 
He fears that “the most brilliant young minds 
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may leave science if they perceive it to reward 
something other than good research” He feels 
that the data point towards a thin ice shell but 
that political powers have marginalized this 
interpretation and those scientists who advo- 
cate it. When discussing his own work, Green- 
berg generously bestows much credit on his 
former students, postdocs and colleagues. 

In Unmasking Europa, Greenberg succeeds 
in conveying a story, not of heroes and vil- 
lains, but about the rise and fall of ideas and 
how some become accepted for reasons that 
perhaps go beyond empirical support. In 
Greenberg's earlier work, Europa the Ocean 
Moon (Springer, 2005), which is of similar 
scope but targeted to a research audience, the 
political storyline is not particularly appropri- 
ate. In his latest work, he delivers an accessible 
and well-laid-out popular-science treatment 
in which the political narrative is more perti- 
nent, although obviously biased towards his 
own perspective. Greenberg uses humour to 
balance out the tone, as in his suggestion that 
the reader should buy a second copy of the 


Europa’s fractured icy surface 
could conceal life beneath. 


book just to cut out the 
images and do the geolog- 
ical reconstructions while 
reading the first copy. 
Tides are the recurring 
theme of Greenberg's treat- 
ment — they “connect the 
orbits of Jupiter's moons to 
the geology of Europa, creating 
environments potentially suitable 
for life”. Only one short chapter deals 
with the possibility of a biosphere; more 
detail on the known chemistry of Europa 
would have been welcomed. Nevertheless, his 
treatment of tidal dynamics is thorough. 
Europa has not yet revealed a smoking gun, 
as have the icy plumes of Enceladus, to indicate 
that it is geologically active today. This has left 
the planetary geology community staring at the 
limited imagery of Europa, wondering what 
its surface features reveal about the interior. 


OPINION 


Centuries ago, geologists 


formitarian mantra of ‘the 
present being the key to the 


of planetary science, with 
data streams punctuated 
by missions that are all too 
rare, we often find ourselves 
struggling to decipher the 
geological present, much 
less the past. 
Unmasking Europa pro- 

vides a comprehensive and 
engaging account of Europa’s 
past and present, and sets the 
stage for the many questions that 
will be answered by future missions as we con- 
tinue our search for life beyond Earth. a 
Kevin P. Hand is a scientist at NASA's Jet 
Propulsion Laboratory, California Institute of 
Technology, 4800 Oak Grove Drive, Pasadena, 
California 91109, USA. 
e-mail: knand@jpl.nasa.gov 


Scripting scientists’ lives 


Leave a Light On 

Ensemble Studio Theatre, New York City 
22 January 2009. Part of the First Light 
Festival, which runs until March 2009. 


Last year, at a first reading of her play about 
the life of biologist Robert Trivers, Ann Marie 
Healy noticed a stranger in the back of the 
theatre, laughing. Afterwards, the man strode 
over to the actor who had played the young 
biologist as a foul-mouthed and promiscuous 
genius working out the evolutionary logic of 
human kindness and conflict, and said: “You 
got it exactly right” That stranger was Trivers. 

Healy’s play features in New York's First Light 
Festival, a collaboration between the Ensemble 
Studio Theatre and the Alfred P. Sloan Founda- 
tion that incubates science-based theatre. The 
festival, which has run annually for more than 
a decade, includes nine full-length plays this 
year and continues until the end of March. 

In Leave a Light On, Trivers is portrayed as 
an ambitious, untenured professor who ruffles 
feathers at Harvard University’s department 
of zoology as he attempts to take a Darwinian 
approach to human nature. Dissatisfied with 
an academic culture that is hostile to his ideas, 
Trivers retreats to Jamaica to study lizards 
and then moves to teach at the University of 
California, Santa Cruz, where he meets Huey 


Newton, former leader of the Black Panther 
party. Widely believed to have dropped ‘off the 
grid’, Trivers returns to academia more than 
a decade later to study the adaptive value of 
self-deception. 

Healy weaves in the science with a light touch. 
In the play, with the help of a female colleague 
who is also a love interest, Trivers works out 
his theory of reciprocal altruism using a series 
of imaginary birds with distinct approaches to 
selfless behaviour: Suckers, who always groom 
their peers; Cheaters, who never do; and Grudg- 
ers, who only groom tit-for-tat. As in Tom Stop- 
pard’s play Arcadia, the script darts between 
centuries and characters, punctuating Trivers's 
sobering career with farcical episodes from the 
courtship of Charles and Emma Darwin that are 
meant to explain the logic of gene competition. 


Robert Trivers's ideas on behaviour caused conflict. 
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The play hardly needs such asides: Trivers’s own 
ideas are enough to drive the plot. 

The 2009 First Light Festival began with an 
uneven selection of one-act plays collectively 
called E = mc”"", portraying a chemist dis- 
covering her brother’s meth lab, an Olympic 
gymnast trying to prove her rivals are under- 
age, and a mathematician confronting risk in 
an airport restaurant. The full-length plays take 
on an equally wide range of topics. Anna Zie- 
gler’s Photograph 51 portrays the familiar story 
of biophysicist Rosalind Franklin, whose X-ray 
diffraction images led the way to the discov- 
ery of DNA structure in 1953. Tommy Smith's 
Beautiful Night will show Soviet inventor and 
electronic-music pioneer Léon Theremin falling 
in love with a black ballerina in New York City 
in the 1930s — with live accompaniment from 
the eerie-sounding theremin instrument. And 
in the improbable monologue Five Easy Steps to 
Metaphysical Fitness: They Actually Work, come- 
dian Emily Levine will impart wisdom gained by 
staging a one-woman show about physics while 
struggling with her pituitary-gland disorder. 

“The goal is not just to demystify science but 
to showits intrinsic appeal, both emotional and 
intellectual,” says Darcy Kelley, a neurobiologist 
at Columbia University and an adviser to the 
theatre. “Then science itself becomes a charac- 
ter, not just window dressing” a 
Jascha Hoffman is a writer based in New York. 
e-mail: jascha@jaschahoffman.com 


See http://tinyurl.com/7otvba for more details on 
the First Light Festival. 
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Pre-emptive blood flow 


David A. Leopold 


Electrical signalling among brain cells summons the local delivery of extra blood — the basis of functional 
brain imaging. Yet sometimes, blood is sent in anticipation of neural events that never take place. 


The brain makes up only 2% 
of the human body mass, but 
because of its high energy 
demands it receives more 
than 15% of the cardiac out- 
put. When blood enters the 
brain, it doesn't course indis- 
criminately through the organ’s 
vessels; instead, it is selectively chan- 
nelled to specific regions in a need-based 
fashion. On page 475 of this issue, Sirotin and 
Das’ show that, sometimes, blood is sent to 
the brain’s visual cortex in the absence of any 
stimulus, priming the neural tissue in appar- 
ent anticipation of future events. 

We know a lot about the physiology of the 
cerebral cortex, the folded sheet of densely 
packed grey matter that forms the outer sur- 
face of the brain. Its neurons generate electrical 
impulses that carry information about stimuli 
and events and transmit it to other neurons. 
Arteries, arterioles and capillaries deliver 
fresh blood to neurons, and supporting glial 
cells surround both neurons and blood vessels, 
regulating blood flow and performing various 
housekeeping roles. 

Housekeeping in the brain is a challenge, 
because neurons undergo sudden bursts of 
activity that consume energy and pollute their 
surroundings. Consider, for example, what 
happens in the cortex when we first direct 
our gaze to a bright stimulus. In the visual 
cortex — the region of the cortex specialized 
for vision — thousands of previously quiescent 
neurons suddenly erupt in a cacophony of 
activity, each generating hundreds of electrical 
impulses per second. In response to the meta- 
bolic consequences of such activity, fresh blood 
is directed towards neurons and glia in active 
regions, flushing out waste products, delivering 
nutrients and restoring the local milieu. 

Brain mapping techniques such as functional 
magnetic resonance imaging (fMRI) measure 
blood flow (haemodynamics) rather than 
neural activity. The accuracy with which fMRI 
monitors neural functioning in the human 
brain depends on the precise coupling between 
neural activity and blood flow. Although details 
of how neural activity triggers changes in blood 
supply are a topic of active debate, it is generally 
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Figure 1| Behavioural modulation of blood flow. Blood flow in the visual cortex is normally modulated 
in step with neural responses to visual stimuli. Sirotin and Das’ show that the vascular response in this 
area of an alert monkey’s brain is readily modulated by its expectation of the task, even in complete 
darkness and without accompanying neural modulation. 


assumed that the two signals are tightly coupled 
in both space and time. 

Yet biology can provide exceptions to every 
rule, and Sirotin and Das' seem to have tapped 
into a big one. Their study shows that corti- 
cal blood flow can depart wildly from what is 
expected on the basis of local neural activity. 
They observed this mismatch in alert rhesus 
monkeys by simultaneously measuring vascu- 
lar and neural responses in the same region of 
the visual cortex. Changes in the blood supply 
were monitored by a sensitive video camera 
peering at the surface of the brain through a 
transparent window in the animal’s skull, and 
local electrical responses of neurons were 
measured with a microelectrode. The monkeys 
were oblivious to all of this, focusing instead on 
a behavioural task that would earn them a juice 
reward. The task required the animals to fixate 
their gaze on a tiny spot on a computer monitor 
for several seconds at a time. 

When Sirotin and Das presented the monkeys 
with a conventional visual stimulus under 
these conditions, they observed a close corre- 
spondence between neural and haemodynamic 
responses to the stimulus, as expected on the 
basis of much previous work’. The surprising 
finding came in trials without a visual stimu- 
lus, when the visual cortex should have been 
disengaged (Fig. 1). 

Here — aside from the tiny spot directing 
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the animal where to look — the task was 
carried out in complete darkness. Against all 
expectations, however, the haemodynamic 
signal continued to rise and fall. Indeed, the 
video of the cortical surface continued to reveal 
the cyclical ebb and flow of cerebral blood, 
which accompanied simultaneous changes 
in blood oxygenation and arterial diameter. 
By contrast, during this same trial, the neural 
signal fell nearly silent. Individual neurons in 
the same patch of cortex ceased to show any 
changes in their rate of generating impulses, 
and only the faint swell of background electri- 
cal activity indicated that cortical neurons were 
at all stirred by the behavioural task. 

So what might be the origin of this vascular 
priming? One possibility is that the subtle 
background swells led to increased blood 
flow through the local release of metabolic 
factors. Sirotin and Das argue, however, 
that this interpretation is unlikely because 
the robust haemodynamic responses they 
observed did not bear a reliable relationship 
to the neural signals they measured. Instead, 
this study — perhaps more than any previous 
work — highlights the potential role of direct, 
task-related neural control of vascular tone. 
Both glia and blood vessels receive signals from 
diverse types of neuron, including signals from 
neurons in brain centres controlling attention 
and arousal’. The task-based modulation of the 
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vascular response might therefore represent 
input from a part of the brain that can antici- 
pate probable neural activity in a specific brain 
region over the coming seconds, and so prime 
that region. 

But how specific might such an anticipatory 
signal be? As the authors’ founda similar modu- 
lation in other systemic markers, such as heart 
rate and pupil diameter, might the observed 
modulation reflect an overall change in the 
brain’s blood supply? A control experiment 
involving an auditory task argues against this 
possibility. Unlike the fixation task, periodic 
attention to an auditory stimulus did not elicit 
haemodynamic modulation in the visual cortex. 
This control experiment, although doing little to 
clarify the origin of the haemodynamic modula- 
tion, shows that such priming depends on the 
type of sensory stimulus that the brain expects. 

The mere mismatch between blood flow and 
neural activity per se is not a great surprise. 
Indeed, the precise relationship between 
neural activity, metabolism and blood flow has 
always been difficult to pin down. Early func- 
tional imaging studies revealed a quantitative 
discrepancy between oxygen consumption by 
an activated brain region following a sensory 
stimulus and the corresponding blood-flow 
response to that region*. Simply put, much 
larger amounts of oxygenated blood were 
delivered to an active region than were required 
on the basis of metabolic demands. Earlier 
work has also shown that the correspondence 
between neural activity and blood-based imag- 
ing signals is highly situation-dependent”®, 
highlighting the complex relationship between 
neural activity, metabolism and blood flow’”. 

Yet the neurovascular mismatch reported 
by Sirotin and Das' is extreme. The clear and 
rhythmic haemodynamic modulation in the 
visual cortex spurred by a task performed in 
complete darkness is sure to raise eyebrows 
among the human fMRI research community. 
For one thing, most fMRI experiments involve 
the periodic presentation of sensory stimuli, 
and then rely on the temporal structure of the 
haemodynamic response for deducing local 
neural activity. The present study clearly demon- 
strates that some of the assumptions underlying 
such analysis — namely, that cyclical variations 
in blood flow reflect local, stimulus-driven 
events — may sometimes be incorrect. a 
David A. Leopold is in the Unit on Cognitive 
Neurophysiology and Imaging, Laboratory of 
Neuropsychology, National Institute of Mental 
Health, Bethesda, Maryland 20892, USA. 
e-mail: leopoldd@mail.nih.gov 
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ASTROPHYSICS 


Galaxies in from the cold 


Reinhard Genzel 


Computer simulations of the cosmos suggest that cold streams of gas 
could underlie the unexpectedly high star-formation activity of many 
massive galaxies found to exist a few billion years after the Big Bang. 


Recent surveys of galaxies’ have found evidence 
that galaxies with masses comparable to 
or greater than that of the Milky Way were 
already present in large numbers about 3 bil- 
lion years after the Big Bang. What’s more, 
a significant fraction of these massive gal- 
axies seem to have been gas-rich, rotating 
disks in which stars formed at a rate of up to 
150 solar masses per year, 50 times the rate in 
the present-day Milky Way~*. Most of these 
star-forming galaxies do not seem to be the 
aftermath of mergers of smaller systems’, and 
are found to produce stars steadily over a long 
period of time* — characteristics that are at 
odds with the prevailing view of how galax- 
ies form and evolve. So, if mergers are not the 
cause, what else could trigger the formation 
of stars in these massive galaxies? Building on 
earlier work®’, Dekel et al.* (page 451 of this 
issue) use cosmological simulations’ to show 
that the galaxies might grow and form stars as 
a result of being fed by rapid, narrow streams 
of cold gas. 

In the classical picture of their formation”, 
galaxies are created when gas cools and col- 
lects at the centres of collapsing haloes of dark 
matter. They then evolve to form larger galax- 
ies by merging with smaller galaxies. But the 
fundamental properties of atomic cooling divide 
evolving galaxies into two branches. Galaxies 
with a dark-matter mass below a critical value 
of about half the mass of the Milky Way (about 
500 billion solar masses) would have grown 
rapidly through accretion of cold gas, resulting 
in sustained (or continuous) star formation. In 
contrast, massive galaxies would have grown 
at a much slower pace (determined by the 
gas cooling rate), and bursts of star formation 
would have occurred only when parent galaxies 
of comparable masses underwent intense, rapid 
mergers (known as major mergers). 

The classical picture thus predicts that active 
star-forming galaxies of low mass should 
already have been abundant at early epochs in 
the history of the Universe, whereas massive 
galaxies should have assembled later through 
mergers of smaller systems. But Dekel et al.* 
challenge this picture using substantially 
improved, high-resolution hydrodynamical 
simulations’ representing a large volume of 
the cosmos. 

Perhaps the most notable aspect of Dekel 
and co-workers’ study is the indication that, 
at early epochs (redshift = 2), haloes with a 
mass substantially above the average for that 
epoch tend to form at the dense nodes of the 
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‘cosmic web of dark matter, which comprises 
long filaments of denser gas connecting these 
nodes. As a result, much of the gas in the 
filaments remains cold and flows at high 
speed from large distances deep into the halo, 
near the evolving disk in which stars will 
subsequently form. Under these conditions, 
massive galaxies above the critical mass can 
grow rapidly and steadily. 

Dekel et al. find, however, that two other 
requirements must be met to explain the high 
star-formation rates of massive, early-epoch 
galaxies. The first is that the accretion of 
material must be largely gaseous, with only a 
small fraction of stars involved. The second is 
that the conversion of accreted gas into stars 
must be highly efficient. Because most of the 
gas in the streams is smooth or exists in low- 
mass clumps and smaller (satellite) galaxies, 
the first criterion means that the star-formation 
efficiency in the streams must be low. Quali- 
tatively, this might be plausible if massive 
stars that did manage to form in the streams 
inject energy back into the interstellar medium 
through supernova explosions and stellar 
winds. Such ‘stellar feedback’ can thus dis- 
perse the surrounding gas and halt further star 
formation in the streams, and it would do so 
more effectively in lower-mass systems because 
of their lower gravitational binding energy. In 
addition, the fraction of dense molecular gas 
required to form stars may be lower in these 
lower-mass clumps, which contain fewer 
heavy elements and have less shielding against 
destructive ultraviolet radiation from hot stars 
and the intergalactic radiation field. 

Although star formation in present-day 
galaxies is inefficient, the necessary high 
star-formation rates at early epochs might be 
reached if star formation consumes piled-up 
gas at the same rate as it is deposited. This 
requires that the fraction of gas ejected from 
the evolving massive galaxy by stellar winds 
and supernovae is modest, which might 
be at odds with observations". Alternatively, 
or additionally, a higher efficiency of star 
formation or a different distribution of 
stellar masses may be required in the early- 
epoch systems compared with those in the 
present-day Universe. Dekel and colleagues’ 
simulations cannot yet resolve the complex 
interaction between the gas inflow and the 
disk, which might answer many of the remain- 
ing questions. Much more detailed simulations 
will be required to correctly model the radia- 
tion, energy balance, dynamical structure 
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and star formation in the embryonic disks. 

It is nevertheless tempting to conclude 
that the cold streams hypothesized by Dekel 
et al.® can explain the formation of the early- 
epoch massive disks. If so, what happens next 
in the process of galaxy formation? Gas-rich, 
turbulent disks are unstable, and prone to 
fragmentation and the formation of massive 
star-forming clumps of gas”, in agreement with 
observations’. Dynamical friction then forces 
the clumps to spiral rapidly into the centre of 
the galaxy, forming a central bulge surrounded 
by a remnant disk"’, whose present-day relic 
may be the old ‘thick disk component seen 
in nearby galaxies. Gas accretion decreases 
naturally at later epochs, perhaps aided by the 
energy injection of massive black holes that 
form at the galactic centre. Disk turbulence 
then subsides, and a maturing thin disk can 
plausibly grow at redshift <1. Depending on 
whether or not a major merger occurs during 
this period, the end-product might be a mas- 
sive elliptical or disk galaxy. The work by Dekel 


and co-workers may turn out to be a decisive 
stepping stone in elucidating the origin of these 
massive galaxies. a 
Reinhard Genzel is at the Max Planck Institute 

for Extraterrestrial Physics, 85748 Garching, 
Germany, and in the Department of Physics, 
University of California, Berkeley, USA. 

e-mail: genzel@mpe.mpg.de 
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STRUCTURAL BIOLOGY 


Actin ina twist 


Kenneth C. Holmes 


How monomers of the cytoskeletal protein actin join to form the stable 
polymers crucial to muscle contraction and cellular motility has beena 
long-standing question. A state-of-the-art approach provides an answer. 


The actin protein is abundant in all eukaryotic 
cells (those characterized by a membrane- 
bound nucleus), and is particularly prevalent 
in muscle, where it comprises about 20% of 
the total mass. Actin comes in two forms: 
monomeric, globular G-actin; and polymeric, 
filamentous F-actin. F-actin forms through 
polymerization of G-actin, a process that has a 
central role in cell motility. In non-muscle cells, 
for example, polymerization of F-actin induces 
cell movement by pushing structures such as 
membranes forwards’. Also, when one stands 
or walks, all of the tension causing muscle con- 
traction is produced in muscle fibres by F-actin 
filaments interacting with the motor protein 
myosin”. Although the atomic structure of 
G-actin has been known for almost 20 years’, 
structural details of the F-actin monomer — 
which is similar, but not identical, to G-actin 
— have remained elusive. Reporting on page 
441 of this issue, Oda et al.° derive a high- 
resolution structure of F-actin from analysis 
of X-ray fibre-diffraction patterns to elucidate 
the transition from G- to F-actin. 

The structure of G-actin has been deter- 
mined independently more than 30 times. 
These data show that G-actin is a rather flat 
molecule built from two similar major domains 
(outer and inner) related by a pseudo-dyad, 
with a molecule of the nucleotide ATP, or its 


hydrolysed version ADP, bound between them 
(Fig. 1a). The terms outer and inner refer to the 
position of these domains in the F-actin struc- 
ture. Electron-microscopy data® have shown 
F-actin to be made of two chains that turn 
gradually round each other to form a right- 
handed ‘long-pitch’ helix. The inner domain 
is closer to the axis of this helix. 

By contrast, solving the structure of F-actin 
has been challenging. Despite a plethora of 
modifications to the G-actin structure — 
including the use of various ATP analogues, 
drug binding, capping proteins or crosslink- 
ing to make the G-actin look like F-actin — all 
of the structures derived have been essentially 
the same as that of the G-actin monomer, and 
none has yielded the secret of the G- to F-actin 
transition. 

The most detailed data on the structure of 
F-actin came from X-ray diffraction of arrays 
of this filamentous protein oriented in a liquid- 
crystalline gel’. Diffraction from an oriented 
gel gives a fibre diagram — a section through 
the diffraction pattern ofa single fibrous mol- 
ecule that has been averaged by spinning it 
around the fibre axis (cylindrical averaging). If 
the molecule has periodic repeats, as in F-actin, 
then the fibre diagram consists of a series of 
lines called layer lines. The best way to inter- 
pret such a diagram is to compute it from a 
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Figure 1| Monomer versus polymer. a, G-actin 
monomers consist of two similar domains: outer 
(grey) and inner (blue). The names relate to the 
position of each domain in the F-actin helix — 
the inner domain is closer to the helix axis. The 
bound ADP molecule (magenta) is sandwiched 
between the inner and outer domains. (Data 
from ref. 9.) b, Oda et al.° provide an atomic 
model of the F-actin helix. This helix repeats 

in six left-handed turns (measuring 35.7 
nanometres). Each repeat contains 13 molecules 
so that the first molecule of each repeat 
(arrowed) is in an identical orientation. Because 
the rotation per molecule (167°) is close to 180°, 
the F-actin structure appears as two long-pitch 
helices slowly winding around each other. Each 
of the 14 G-actin molecules of the helix is shown 
in a different colour, apart from the grey—blue 
molecule that is shown in two colours for 
comparison with a (asterisk). 
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50 YEARS AGO 

With the co-operation of the 
West Wales Field Society, 

the Nature Conservancy is 
purchasing the island of Skomer 
as a National Nature Reserve. 
Skomer, 722 acres, is the 

largest of the islands off the 
Pembrokeshire coast ... Each 
year great numbers of sea-birds 
breed on Skomer, as well as a 
strong colony of the Atlantic grey 
seal. Probably the most notable 
breeding species is the Manx 
shearwater ... The puffin colony 
is, next to that of St. Kilda in the 
Hebrides, probably the largest 

in the British Isles. The island is 
also well known for the Skomer 
vole, which differs from the 
common bank vole in its larger 
size, extreme tameness and 
brighter colour. Skomer is being 
leased to the West Wales Field 
Society, which made an extensive 
field survey of the island in 1946 
... Scientific investigations on the 
reserve may be arranged with the 
Regional Officer for South Wales 
of the Nature Conservancy. 

From Nature 24 January 1959. 


100 YEARS AGO 

Everyone who is working at 
radio-activity at the present 
time feels the need of a 

standard of activity in terms 

of which all measurements of 
activity can be expressed. It 

was suggested three years ago 
by Prof. H. N. McCoy ... that the 
activity of one square centimetre 
of a layer of suitable thickness 

of uranium oxide, U;0;, would 
furnish an excellent standard. In 
the December (1908) numbers 
of the American Journal of Science 
and of Le Radium Prof. McCoy 
gives an account of the work he 
has done... to show that such 

a layer has all the properties 
required in a standard. The oxide 
is easily prepared, and samples 
prepared from three different 
sources gave identical results. 

A layer of thickness such that 
0.02 gram goes to the square 
centimetre gives the maximum 
activity due to the a rays. The 
radiation due to the B rays 

is small. 

From Nature 21 January 1909. 
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Figure 2 | The secret of G- to F-actin transition. a, The inner (blue) and outer (grey) domains of a single 
actin molecule as seen in Figure 1a. Traces of the local rotation axis and the helix axis are also shown. 

b, c, G-actin (b) and F-actin (c) as seen at approximately right angles compared with a, looking along 
the local rotation axis. Note that, in passing from G-actin to F-actin, the outer domain rotates by 20° 
with respect to the inner domain. Consequently, the F-actin structure is substantially flatter. 


starting model and then use a refinement 
process to modify the model and arrive at a bet- 
ter fit with experimental data. Unfortunately, 
cylindrical averaging causes loss of informa- 
tion. Furthermore, thermal movement of the 
molecules in the liquid-crystalline sample 
causes them to become disorientated, which 
leads to a smearing out of the layer lines. So 
success depends on preparing the sample with 
the best possible orientation — that is, with all 
molecular axes as parallel to the fibre axis as 
possible — and investigating it at the highest 
possible resolution. 

Oda et al.’ used an intense magnetic field to 
improve the orientation of their sample, and 
a highly collimated, intense X-ray source to 
collect data. They started with a model made 
by placing the crystal structure of the G-actin 
molecule in the best orientation in the F-actin 
helix, rather like the original structure of 
F-actin that my colleagues and I proposed’. 
They then calculated the low-energy vibra- 
tional modes of the G-actin monomer and 
selected the combination of modes that best 
fitted the fibre diagram. 

Using this improved structure, they repeated 
the procedure. When no further improvements 
could be made, the authors turned to simu- 
lated annealing. In this molecular dynamics 
procedure, a molecule is heated for a few pico- 
seconds to agitate the atoms so as to sample 
a range of possible structures. The molecule 
is then slowly cooled, with the fit to the fibre 
diagram acting as a pseudo-force to steer the 
process to the correct structure®. So Oda et al. 
finally achieved a very good fit to the fibre 
diffraction pattern. 

Despite the complexity of this procedure, the 
authors’ F-actin structure° (Fig. 1b) is convinc- 
ing in its simplicity. The transition from G- to 
F-actin seems to involve a 20° rotation of the 
outer domain with respect to the inner domain 
about a rotation axis roughly at right angles 
to the helix axis (Fig. 2). In G-actin the two 
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domains are related by a propeller-like twist. 
The 20° rotation reduces this twist and flattens 
the molecule. Apart from this rotation anda 
reorientation of a flexible loop at the top of 
the outer domain, no other substantial change 
seems to occur. 

Actin polymerization, essential for cell 
motility, is driven by ATP hydrolysis. Whereas 
G-actin cannot hydrolyse ATP, F-actin can. Oda 
et al. find that one effect of the 20° rotation is 
to bring an evolutionarily conserved glutamine 
residue at position 137 — which is implicated in 
the ATP hydrolysis mechanism — closer to the 
B- and y-phosphate groups of the ATP mole- 
cule. So the rotation may be the switch for turn- 
ing on the ATP-hydrolysing activity of F-actin. 
In addition, the flattening of monomers within 
F-actin substantially alters the site on this pro- 
tein to which the muscle protein myosin binds 
— an interaction essential for muscle contrac- 
tion; this could explain why myosin binds with 
high affinity to F-actin but not at all to G-actin. 
Thus, the new structure will certainly become 
an essential ingredient in our understanding of 
cell motility and muscle contraction. 

Finally, the flat F-actin is very much like a 
bacterial analogue of actin called MreB. So 
Oda and colleagues’ structure also lends sup- 
port to the idea that actin is a bridge between 
eukaryotes and prokaryotic organisms such 
as bacteria. a 
Kenneth C. Holmes is at the Max Planck Institute for 
Medical Research, D69120 Heidelberg, Germany. 
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Shifts in season 


David J. Thomson 


It's cold in winter and hot in summer. But the latest analysis illustrates the 
need to put observational data at the forefront of attempts to achieve a 
more detailed understanding of the annual temperature cycle. 


It has been known for more than a century! 
that increasing the concentration of carbon 
dioxide in the atmosphere results in an 
increase in Earth’s surface temperature. By 
contrast, it is only just over a decade since the 
discovery that CO, levels also affect the timing 
of the annual temperature cycle*”, although the 
details remain enigmatic. 

On page 435 of this issue, Stine and col- 
leagues* describe how they have updated and 
extended earlier studies of the annual cycle**, 
using better spatial coverage and more recent 
data. They have concentrated on the temperate 
zones because of the dominant annual temper- 
ature cycle and, at least in the Northern Hemi- 
sphere, the reasonable spatial data coverage 
in those zones. As well as incorporating some 
technical improvements, the authors analysed 
the annual temperature cycle over the oceans. 

The annual cycle hastwo distinct components, 
amplitude and phase. Stine and colleagues 
conclude that the amplitude — loosely, half 
the difference between summer and winter 
temperatures — has been decreasing over 
most continental areas and increasing over the 
oceans. The phase describes the relative timing 
of the periodic (seasonal) component of tem- 
perature. For the most part, the seasons occur 
earlier over land and later over the oceans, and 
Stine et al. estimate the terrestrial phase shift 
to have been 1.7 days between 1954 and 2007. 


This shift, and the changes in amplitude, are 
highly anomalous when compared with the 
data from between 1900 and 1953, implicat- 
ing human agency as the cause. 

The common perception of the timing of the 
seasons is more complicated because it involves 
both changes in the annual cycle, discussed 
here, and the increase in average temperature. 
(See Figure 1 of the Supplementary Informa- 
tion‘ for a graphic description of the different 
effects.) For example, taking the date from 
which the temperature usually stays above 
freezing as marking the start of spring, the 
increase in average temperature, the smaller 
seasonal amplitude (which implies warmer 
winters) and the change in phase all work in 
the same direction, so the observed effect is 
large. This is well documented in studies of 
bird migrations and similar phenomena’®, 
where one finds many examples of seasonal 
patterns shifting to an earlier date by more than 
a month (Fig. 1). Phase changes have also been 
invoked to help explain problems ranging from 
the theory of palaeoclimates’ to changes in sea 
level’® and even in human mortality". 

Stine et al.‘ also compare their observations 
with the results of a suite of two dozen climate 
models used by the Intergovernmental Panel 
on Climate Change (IPCC), and the results are 
dismaying. Some of these models reproduce 
the decrease in amplitude, first shown in 1980 


(ref. 12), but none predicts, or even reproduces, 
the change in phase. I have no personal experi- 
ence with these models, so beyond a general 
scepticism about complicated models (per- 
haps best expressed by George Box’s dictum, 
“All models are wrong but some are useful”), I 
cannot say why they fail. We must remember, 
however, that although climate models incor- 
porate an amazing variety of effects and get 
many things right, they are almost certainly 
missing many more. 

As an example, in the mid-1990s I was dis- 
cussing the phase problem with members ofa 
modelling group and learned that their model 
had Earth ina circular orbit with no preces- 
sion. This was astonishing. First, we are trying 
to measure the effects of CO, to high accuracy 
— say 0.01 °C, in a system in which annual 
temperature extremes routinely exceed + 50°C. 
Second, on an ice-age timescale, the effects of 
precession are immense, strong enough to be 
used asa clock. Third, we have known that the 
orbit is elliptical since Johannes Kepler in the 
seventeenth century, and about precession 
since Hipparchus (around 150 Bc). The dura- 
tion of the instrumental temperature record is 
now 1% or 2% of the 26,000-year precession 
cycle: when trying to measure small effects it 
is unwise to ignore large ones. 

One should also note the contrast between 
the enormous computational resources used 
by the models and the relatively meagre effort 
required to analyse real data. Thus, work of 
the type done by Stine et al. is to be applauded. 
Ignoring the time required to assemble the data 
and write the programs, it probably took no 
more than a few seconds of computer time to 
show effects that were not predicted by any of 
the models. As Richard Feynman commented, 
“Tt doesn't matter how beautiful your theory is, 
it doesn’t matter how smart you are. If it doesn’t 
agree with experiment, it’s wrong” 


Figure 1| Migrating greylag geese. Spring, and associated phenomena such as bird migrations, now occur earlier than at the start of the twentieth century. 
Stine and colleagues’ data‘ quantify the seasonal shift. 
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Where do we go from here? One of many 
perplexing problems is the year-to-year vari- 
ations in phase and amplitude in temperature 
data. These variations are obvious in all of 
the long-term temperature records and are 
reasonably consistent with variations in the 
Sun’s magnetic field. We do not understand 
the subtle influences on climate exercised by 
solar effects such as the solar wind, the charged 
particles that flow out from the Sun. Obser- 
vational evidence for such a coupling has 
been accumulating for decades, through 
both palaeoclimate data’’ and studies of the 
upper atmosphere’. However, when one has 
observed the Sun’s acoustic oscillations in 
barometric pressure’, it is possibly time to pay 
attention to solar observations. 

The solar wind carries much more energy 
than is available from Edward Lorenz’s butter- 
flies, often used to ‘explaim purported chaotic 
behaviour in climate. This raises a philosophi- 
cal question, as to whether the fascination 
with ‘chaoplexology’ in climate research has 
resulted in a failure to take observations and 
statistics seriously enough. Climate may be 
formally chaotic, but so is Earth’s orbit’ and 
this has not prevented people from analysing 
it in exquisite detail. In my opinion, chaos, 
fractals, long-memory processes and their ilk 
should be invoked only when all of the various 
climate forcings have been carefully studied 
and all simpler explanations eliminated. We 
are not even close to meeting that goal. 

Finally, independent of any shortcomings in 
the models, we must remember that the obser- 
vational evidence for human influence on the 
climate system is overwhelming. Stine and 
colleagues’ paper* adds to that evidence. If 
we do not stop polluting Earth’s atmosphere, 
we may not have enough time left to develop 
models sophisticated enough to show what is 
obvious in the data now. a 
David J. Thomson is in the Department of 
Mathematics and Statistics, Queen's University, 
Kingston, Ontario K7L 3N5, Canada. 
e-mail: djt@mast.queensu.ca 
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MOLECULAR BIOLOGY 


Concealed enzyme coordination 


Elio A. Abbondanzieri and Xiaowei Zhuang 


Coordination between subunits is crucial for the proper functioning of 
multi-component molecular machines. A single-molecule study now allows 
glimpses into the mechanism used by subunits of one such machine. 


Even 2,000 years ago, Aristotle had noted that 
the whole is more than the sum of its parts. 
This maxim also holds true in the cell, where 
enzymatic proteins frequently combine to 
form multimeric complexes that allow indi- 
vidual subunits to coordinate their activities 
and so perform more difficult tasks than they 
could alone. A prominent example of such a 
complex is the ring ATPases’, in which — as 
their name implies — several subunits form 
circular complexes consisting of identical 
(homomeric) or non-identical (heteromeric) 
subunits. These enzyme complexes use energy 
released from the hydrolysis of ATP molecules 
to perform diverse cellular functions, such as 
DNA translocation, protein degradation and 
ion transport. On page 446 of this issue, Mof- 
fitt and co-workers’ provide the first direct 
measurement of a single enzymatic cycle by a 
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bs L Tag helicase 


homomeric ring ATPase, revealing an unex- 
pected form of coordination between the 
subunits. 

Subunits of the various ring ATPases can 
coordinate their activities in different ways. 
For instance, the three heterodimers of the 
F1-ATPase act sequentially, each binding an 
ATP molecule and hydrolysing it in order’ 
(Fig. la). By contrast, subunits of the L Tag hel- 
icase of simian virus 40 seem to act in concert, 
all six of them simultaneously binding then 
hydrolysing ATP molecules* (Fig. 1b). Sub- 
units of the unfoldase enzyme ClpX, however, 
are thought to act randomly, each one hydro- 
lysing ATP independently, with their activities 
probably being coordinated by the geometry of 
the complex’ (Fig. 1c). 

To investigate the coordination mechanism 
of a homomeric ring ATPase in detail, Moffitt 
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Figure 1| Proposed coordination mechanisms of ring ATPases. a, The subunits of Fl-ATPase, which 
exist as three heterodimers, bind to and hydrolyse ATP molecules sequentially. b, Subunits of the 

L Tag helicase, by contrast, function in concert, all simultaneously binding to ATP molecules before 
hydrolysing them. c, Subunits of the ClpX unfoldase seem to function semi-independently ina 
random order. d, Moffitt et al.’ describe a newly discovered two-phase coordination mechanism for 
the homomeric ring ATPase of the bacteriophage $29. Here, the subunits sequentially bind ATP 
molecules during the loading phase, and then, in a separate phase, sequentially hydrolyse ATP to 
translocate the DNA substrate. The exact timing of ATP hydrolysis is not known, and might not occur 
in conjunction with the steps. Circles indicate enzyme subunits, T denotes ATP, and D refers to its 


hydrolysis products. 
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et al.” used a high-precision, single-molecule 
assay involving dual-beam ‘optical tweezers’ 
and differential position detection®’ (Fig. 2). 
For their subject, they chose the five-subunit 
DNA-packaging machine of the bacteriophage 
virus @29. This complex is a powerful molecu- 
lar motor that can translocate the p29 DNA 
into the bacteriophage’s protein shell against a 
strong hindering force®. Previous circumstan- 
tial evidence suggested””” that the subunits of 
the @29 motor use a sequential coordination 
mechanism, packaging roughly 2 base pairs 
(bp) of DNA per ATP molecule hydrolysed. 
But as this putative step size was too small to 
be detected directly, the mechanism of subunit 
coordination remained largely hidden. This 
constraint has now been lifted, as the optical- 
trap assay Moffitt et al. report provides remark- 
able, sub-nanometre precision at a temporal 
resolution of 20 milliseconds. 

The assay produced several unexpected 
results. The first surprise came when the 
authors measured DNA packaging at a low 
hindering force of around 8 piconewtons (pN) 
and found that the motor advances in steps of 
10 bp, five times larger than the 2-bp step pre- 
dicted by bulk biochemical assays. Given the 
five-subunit structure of the motor, this dis- 
crepancy could be reconciled by suggesting a 
concerted hydrolysis model in which the motor 
binds to five ATP molecules and translocates 
10 bp in one step. Indeed, detailed statistical 
analysis of the ‘dwell times’ between steps, 
which was possible owing to the assay’s high 
resolution, indicated that several ATP mol- 
ecules must bind before the next step is taken. 
On closer inspection, however, the 10-bp steps 
seemed to be composed of smaller substeps. 
Although these substeps were too fast to be 
measured at low force, the stepping speed 
slowed substantially at higher forces (about 
40 pN), allowing the 10-bp step to be dissected 
into four 2.5-bp substeps. 

These striking observations raise two 
points. First, how are the five identical subu- 
nits of the 29 motor coordinated to make 
a four-substep — rather than a more logical 
five-substep — advance? It could be that one 
of the subunits has a role, other than directly 
translocating DNA, that is not readily detect- 
able. Alternatively, one of the subunits might 
be truly passive. The second point is that the 
non-integer step size of 2.5 bp seems to be 
intuitively at odds with our knowledge of how 
motor proteins move on periodic tracks. But 
there is no a priori rule that each subunit of 
an enzyme complex must make specific and 
identical contacts with its substrate. The inter- 
action between the 29 motor and DNA may 
be somewhat promiscuous, and the step size 
could be dictated by a conformational change 
within the enzyme, rather than by the periodic- 
ity of the substrate’s structure. 

Moffitt and colleagues’ data* therefore 
challenge the conventional models of how 
DNA-processing enzymes interact with their 
substrates. What’s more, their results suggest 
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Figure 2 | Analysis of DNA translocation by the bacteriophage 629 motor. Moffitt et al.” used a dual- 
beam optical-tweezer assay to study the packaging of DNA by the motor ATPase of 29. They attached 
the 29 protein shell with the packaging motor to a bead and the distal end of the DNA to be packaged 
to a second bead. The beads were levitated in separate optical traps, and the amount of DNA packaged 
was determined by measuring the relative distance between the beads, allowing sub-nanometre 
measurement precision with millisecond temporal resolution. 


an exquisite coordination between the sub- 
units of the @29 motor ATPase that combines 
features of both sequential and concerted 
coordination mechanisms: before any DNA 
translocation occurs, all ATP molecules must 
bind during the long dwell time preceding 
the 10-bp advance; once loaded with ATP, 
the motor hydrolyses ATP and/or releases 
its hydrolysis products (ADP and inorganic 
phosphate) sequentially to produce the four 
successive 2.5-bp substeps (Fig. 1d). 

The cooperative binding of ATP, as implied 
by this model, would seem to be at odds with 
the previously measured’° ‘Michaelis-Menter’ 
— rather than the expected sigmoidal — 
form of dependence of the DNA-packaging 
velocity on ATP concentration. Moffitt et al. 
resolve this apparent paradox with an elegant 
kinetic analysis, which shows that the expected 
simple Michaelis-Menten behaviour could 
be recovered if the binding of each ATP 
molecule is ordered in time and is separated 
by an irreversible event. This finding also indi- 
cates that each subunit must bind to an ATP 
molecule sequentially. Furthermore, temporal 
segregation between a sequential ATP-binding 
phase and a sequential motor-stepping phase 
would require coordination both between 
neighbouring subunits and across the 
entire complex. 

The two-phase coordination mechanism 
is complicated, but could be tested further. 
In previous single-molecule studies of the 
F1-ATPase, fluorescent ATP was used to 
establish that, at any given time, at least two 
binding sites on this enzyme complex are 
occupied by ATP or ADP, and that the steps 
taken by the complex are linked to ATP 
binding and the release of its hydrolysis prod- 
ucts". A similar approach could be used to 
capture the sequential loading of ATP and 
unloading ofits hydrolysis products in the p29 
motor, and to find the precise placement of the 
mechanical step in the ATP hydrolysis cycle. 
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Moffitt and co-workers’ two-phase model 
may prove to be a general mechanism appli- 
cable to other ring ATPases, thus explaining 
why some of these enzymes have a seem- 
ingly sequential coordination mechanism of 
action but also bind to several ATP molecules 
at once™’. Single-molecule approaches could 
help to unravel the coordination mechanism 
of these systems. The optical-tweezer assay, for 
example, is known to be remarkably adaptable 
for examining helicase, polymerase and trans- 
locase enzymes, which operate on nucleic-acid 
substrates. Even folded proteins have been 
pulled apart with optical tweezers'’, opening 
up the possibility of also studying unfoldase, 
protease or chaperone enzymes, which operate 
on amino-acid substrates. It will be exciting to 
learn whether such a diverse group of enzymes 
share common strategies for coordination, or 
if other, as-yet-unidentified forms of subunit 
organization exist. a 
Elio A. Abbondanzieri and Xiaowei Zhuang are 
at the Howard Hughes Medical Institute and 
Harvard University, 12 Oxford Street, Cambridge, 
Massachusetts 02138, USA. 
e-mail: zhuang@chemistry.harvard.edu 
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Daniel Carleton Gajdusek (1923-2008) 


The most outlandish and peripatetic of microbe hunters. 


Daniel Carleton Gajdusek, who won the 
1976 Nobel Prize in Physiology or Medicine 
for his discovery of transmissible dementias, 
died on 12 December 2008 in a hotel in 
Tromso, Norway. This isolated spot above 
the Arctic Circle was the winter refuge for 
American-born Gajdusek during the last 

10 years of his life. He spent his summers 
mainly in Amsterdam. This migratory 
pattern, and his choice to spend winter in 
one of the darkest places on Earth, typified 
the eccentricity of the man. 

Gajdusek was born in 1923 in Yonkers, New 
York, where his father had a butcher's shop. As 
an eight year old, he already seemed to know 
his destiny. He told me once that he inscribed 
the names of all the scientists in Paul de Kruif’s 
book Microbe Hunters — which included 
giants such as Robert Koch and Louis Pasteur 
— on the staircase to his attic chemistry lab, 
leaving the last step blank for himself. 

Gajdusek’s mind was continuously 
attracted to the mysterious and exceptional, 
his rationale being that, to contribute to 
knowledge, you must find unexplained 
phenomena and observe them first-hand. 
From the early 1950s onwards, having trained 
as a research virologist, he recorded his 
scientific endeavours on film so that he could 
share his experiences as directly as possible 
with everybody. 

In 1954, Gajdusek shot a documentary 
entitled Rabies in Man, which followed 
experiments at the Pasteur Institute of Iran 
in Tehran. The institute’s director, Marcel 
Baltazard, had recently shown that almost 
a third of people who had suffered a rabid- 
dog bite to the head could not be saved by 
rabies vaccine. Baltazard considered this 
result disastrous. Gajdusek suggested that he 
should test anti-rabies antibodies (prepared 
from rabbit serum by Herald Cox in New 
York) in combination with the vaccine. 

Baltazard agreed to this, and in August 1954 
he began using the combination therapy in 
18 patients who had sustained head wounds 
from rabid wolves. Gajdusek’s documentary 
meticulously followed their progress during 
treatment. The study showed convincingly 
that addition of rabies antibodies to the 
vaccine can completely protect people from 
disease or death after exposure to rabies virus. 
This regimen has been the gold standard of 
care for the disease ever since. 

Gajdusek’s views on microbiology were 
shaped by the training in physical chemistry 
that he received from Linus Pauling, and 
by schooling in cell biology and virology 
from John Enders — both Nobel laureates. 
From 1955 to 1957, he also worked in 
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Melbourne, Australia, with Frank Macfarlane 
Burnet, who received a Nobel prize in 1960 
for his work on the recognition of ‘self’ 

by the immune system. His experiences 

in this vibrant field taught him to expect 

the unexpected, and prepared him for his 
greatest discovery. 

In 1957, Gajdusek travelled to New 
Guinea upon hearing that Vincent Zigas, a 
district medical officer, had stumbled across 
a mysterious illness — kuru — in the Fore 
tribe of the Highlands. This turned out to 
bea neurological disease affecting women 
and children. It progressed swiftly from an 
initially unsteady gait to tremors and speech 
disorders, leading within months to complete 
incapacitation, and invariably to death. 
Gajdusek suspected from the beginning that 
the disease was caused by a form of ritual 
cannibalism in which only women and 
children participated. 

In 1961, Gajdusek convinced Clarence 
Joseph Gibbs Jr, a specialist in insect-borne 
viruses, to lead a series of experiments 
designed to establish the concept of 
‘transmissible spongiform encephalopathies’ 
(TSEs). Besides kuru, TSEs include 
neurodegenerative illnesses such as 
Creutzfeldt—Jakob disease (CJD) and ‘mad 
cow’ disease. Gajdusek and Gibbs reported 
the successful transmission of kuru to 
chimpanzees in 1966, of CJD to chimpanzees 
in 1968, and of scrapie (the sheep variant of 
TSE) to monkeys in 1972. In later years, CJD 
was categorized with Alzheimer’s disease as 
an amyloidosis (diseases characterized by 
the deposition of insoluble proteins), but 
Gajdusek and co-workers showed in 1980 
that Alzheimer’s disease, unlike CJD, was not 
transmissible. 

Gajdusek’s work revealed the existence 
ofa new kind of infectious agent, one that 
did not need a nucleic acid to replicate. Now 
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called prions, these agents are misfolded 
proteins that can induce misfolding in 

other proteins. The initiation of misfolding 
falls in the twilight zone between normal 
and abnormal protein production, and is 
still not understood. The conspicuously 
unconventional Gajdusek had thus found 

a suitably eccentric infectious agent. The 
discovery won him a Nobel prize, and earned 
him the right to add his name to the staircase 
of the world’s great microbe hunters. 

Eccentricity was the source of Gajdusek’s 
genius as a scientist, and of his notoriety 
late in life. In 1997, he was imprisoned on a 
child molestation charge involving one of the 
more than 50 Micronesian and Melanesian 
children he had adopted and brought to 
the United States. On his release in 1998 he 
moved to Europe, which he regarded as less 
puritanical than his home country. 

Throughout his life, Gajdusek was a fervent 
reader of the world’s literature and a prolific 
writer. He believed in a life of learning and in 
accurate documentation and reflection. An 
example of his acuity was his letter writing. 
When you wrote to him, you got your own 
letter back with an answer to each sentence 
scribbled between the lines. 

Around two months before his death, I had 
dinner with him at the Academic Club of the 
University of Amsterdam, about a minute's 
walk from his university lodgings. It took 
us at least 20 minutes to get there, stopping 
every minute to rest because of his failing 
heart. This cumbersome trip did not stop 
him from talking all the way about his most 
recent interest: the physical evidence in the 
brain revealing a person's reading ability and 
the development of that skill. He kept yelling 
at me, while gasping for air, that before the 
age of six a child could achieve native fluency 
in at least six, if not ten, languages, if properly 
exposed. He must have had his own youth 
in mind: Gajdusek could read at least ten 
languages. 

Gajdusek will be remembered for both his 
scientific contributions and his overwhelming 
presence. As Richard Rhodes observed in his 
book on TSEs, Deadly Feasts, Gajdusek was 
“A compulsive talker who spills ideas nonstop 
for hours — good talk, often brilliant talk and 
consummate story-telling, but more than 
some listeners can bear”. 

Jaap Goudsmit 

Jaap Goudsmit is in the Research and 
Development Department of Crucell Holland, PO 
Box 2048, Leiden, 2301 CA, the Netherlands, and 
in the Academic Medical Center of the University 
of Amsterdam. 

e-mail: j.goudsmit@crucell.com 
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Cover illustration 

In RNA silencing, one 
strand of a small duplex 
RNA (combs) enters a 
silencing complex (platter) 
that contains a catalytic 
Argonaute (pincers) to 
cleave a target RNA (cord). 
(Courtesy of M. Inudo 
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RNA SILENCING 


hen Nature published the first Insight 
on RNA interference (RNAi), in 
September 2004, it was clear that 
RNAi was going to have a broad 
impact on biology, even though only six years had 
passed since the seminal paper by Andrew Fire, 

Craig Mello and colleagues was published. 

But who would have imagined how far we would 
come in the next four years in terms of understanding 
and exploiting this fundamental system of gene 
regulation? There is now a much clearer picture of 
how the small non-coding RNAs involved in this 
type of regulation are generated, drawn from the 
static images provided by crystallographic studies, 
together with the kinetic and mechanistic details 
gleaned through biochemical assays. From large-scale 
efforts to map how gene expression is affected by 
just one class of these small RNAs, microRNAs, it is 
easy to reach the conclusion that when studying any 
biological process, researchers must consider how it 
is regulated by small RNAs. Relationships between 
small RNAs and development are also being uncovered 
almost daily. And nimble biotechnology firms have, 
with breathtaking speed, aggressively translated this 
knowledge into therapeutic candidates. 

It was no surprise that the researchers who opened 
this Pandora’s box were awarded the Nobel Prize in 
Physiology or Medicine in 2006. As Géran Hansson 
stated in his presentation speech for the award, RNAi 
“has added a new dimension to our understanding of 
life and provided new tools for medicine”. However, the 
story is far from complete even now. With advances in 
sequencing technology, for example, more classes of 
small RNA are being identified, and their functions are 
likely to continue to entice and surprise us. 

With these reviews, we hope to convey some of the 
excitement driving this rapidly evolving field forward. 
We are pleased to acknowledge the financial support of 
Alnylam Pharmaceuticals and Roche in producing this 
Insight. As always, Nature carries sole responsibility for 
editorial content and peer review. 


Angela K. Eggleston, Senior Editor 
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On the road to reading the 
RNA-interference code 


Haruhiko Siomi' & Mikiko C. Siomi"? 


The finding that sequence-specific gene silencing occurs in response to the presence of double-stranded 
RNAs has had an enormous impact on biology, uncovering an unsuspected level of regulation of gene 
expression. This process, known as RNA interference (RNAi) or RNA silencing, involves small non-coding 
RNAs, which associate with nuclease-containing regulatory complexes and then pair with complementary 
messenger RNA targets, thereby preventing the expression of these mRNAs. Remarkable progress has been 
made towards understanding the underlying mechanisms of RNAi, raising the prospect of deciphering the 
‘RNAi code’ that, like transcription factors, allows the fine-tuning and networking of complex suites of gene 
activity, thereby specifying cellular physiology and development. 


The discovery of RNA interference (RNAi)' heralded a revolution in 
RNA biology. Researchers uncovered ‘hidden’ layers of regulation of 
gene expression, in which many previously unidentified families of small 
RNAs (consisting of ~20-30 nucleotides) mediate gene silencing. A 
diverse set of gene-regulatory mechanisms were found to use key steps in 
the RNAi process, including mechanisms that silence endogenous genes 
and mechanisms that restrain the expression of parasitic and pathogenic 
invaders such as transposons and viruses” ’. 

The basic RNAi process can be divided into three steps®”. First, a long 
double-stranded RNA (dsRNA) that is expressed in, or introduced into, 
the cell (for example, as a result of the base-pairing of sense and antisense 
transcripts or the formation of stem-loop structures) is processed into 
small RNA duplexes by a ribonuclease III (RNaselIII) enzyme known 
as Dicer. Second, these duplexes are unwound, and one strand is 
preferentially loaded into a protein complex known as the RNA-induced 
silencing complex (RISC). Third, this complex effectively searches the 
transcriptome and finds potential target RNAs. The loaded single-stranded 
RNA (ssRNA), called the guide strand, then directs an endonuclease that is 
present in the RISC (sometimes called the ‘slicer’ and now known to be an 
Argonaute protein*"') to cleave messenger RNAs that contain sequence 
homologous to the ssRNA, over many rounds. In this way, the guide strand 
determines the sequence specificity of the RNAi response. 

In different organisms, the RNAi pathways comprise different 
proteins and mechanisms, but they operate by strikingly convergent 
strategies. In all organisms that have been studied, RNAi involves two 
main components: small RNAs, which determine the specificity of 
the response; and Argonaute proteins, which carry out the repression. 
Depending on both the nature of the Argonaute in the RISC and the 
degree of complementarity between the small RNA and the target 
sequence in the mRNA, the association of the RISC with target mRNAs 
has been shown to have different outcomes: it can control protein 
synthesis and mRNA stability, maintain genome integrity or produce a 
specific set of small RNAs*”. Analyses of the biogenesis of small RNAs 
and their targeting mechanisms have benefited from the advent of high- 
throughput sequencing technologies and sophisticated bioinformatics”*. 
The picture emerging from these studies is that RNAi systems in different 
organisms have been refined in many ways, and such modifications 


include built-in molecular ‘rulers’ that define the size of small RNAs, 
structures that determine which strand of a small RNA is selected, 
mechanisms that direct further rounds of small RNA amplification, or 
safeguards against off-target (unrestricted and unrelated) silencing. 
Another emerging finding in the field is that the activity of RNAi 
pathways is subject to intense regulation at various levels, from the level 
of biogenesis of small RNAs to the silencing mode of the RISC. In this 
Review, we describe the biogenesis of the guide strand of small RNAs 
and the formation and actions of the RISC, and we discuss the current 
understanding of the molecular mechanisms of RNAi in the light of recent 
insights into how silencing pathways are specified and regulated. 


Biogenesis of small RNAs 

A hallmark of RNAi is that short (~20—30 nucleotide) dsRNAs — known 
as small RNAs — are generated by the activity of RNasellI enzymes 
(either Dicer alone or Drosha and Dicer). Two main categories of small 
RNAs have been defined on the basis of their precursors. The cleavage 
of exogenous long dsRNA precursors in response to viral infection or 
after artificial introduction generates short interfering RNAs (siRNAs), 
whereas the processing of genome-encoded stem-loop structures 
generates microRNAs (miRNAs). Using high-throughput sequencing 
technology, several new classes of endogenous small RNA species have 
recently been uncovered, and these include PIWI-interacting RNAs 
(piRNAs) and endogenous siRNAs (endo-siRNAs or esiRNAs). 

A common feature of all of these small RNAs is that they are loaded 
onto Argonaute proteins to effect their targeting function (discussed 
further in the section ‘Loading and sorting of small RNAs by the RISC’). 
An overview of the generation of small RNAs is presented in Fig. 1. 


siRNA biogenesis 

Dicer (Table 1) processes long RNA duplexes and generates siRNAs. 
These small RNAs are ~21-25-nucleotide duplexes with a phosphate 
group at both 5’ ends, and hydroxyl groups and two-nucleotide overhangs 
at both 3’ ends, all hallmarks of RNaselII-mediated cleavage. The Dicer 
protein contains a PAZ domain, which binds to the 3’ end of an siRNA, 
and two RNaselIII domains, which have the catalytic activity. It functions 
as a monomer", but the RNaseIII domains associate with each other to 
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form an ‘internal dimer’ (see page 405). The distance between the PAZ 
domain and the two RNaselIII domains is the length spanned by 25 base 
pairs (bp) of RNA”. Thus, Dicer itself is a molecular ruler. 


miRNA biogenesis 

Similarly, miRNAs are short (~21-25-nucleotide) RNA molecules"*; 
however, their biogenesis differs markedly from that of siRNAs. The 
primary precursors of miRNAs (pri-miRNAs) are encoded in the 
genome, and the relevant genomic regions are mostly transcribed by RNA 
polymerase II (ref. 17). The pri-miRNAs contain stem-loop structures 


a b 
dsRNA precursors 


that harbour the miRNA in the 5’ or 3’ half of the stem. During miRNA 
production in plants, one type of RNasellI, Dicer-like protein 1 (DCL1), 
generates the miRNA-miRNA* duplex in the nucleus (miRNA* being 
the sequence in the stem-loop that pairs with the miRNA, equivalent to 
the passenger strand of siRNA duplexes; discussed later). By contrast, 
in animals, miRNAs are derived in a two-step process, in which the 
nuclear-localized RNasellII Drosha defines one end of the miRNA- 
miRNA* duplex and releases a precursor miRNA (pre-miRNA) of 
~65-70 nucleotides. The pre-miRNA hairpin is then exported to the 
cytoplasm, where Dicer completes the processing. 
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Figure 1| Small RNA production and RNA silencing. a, Natural transcripts 
that form dsRNAs and hairpin-shaped structures can be sources of 
small RNAs. These precursors are processed by an RNaselIII enzyme 
(such as Drosha or Dicer), yielding small RNA duplexes. Duplexes with 
a perfect match (left pathway) are further processed by an enzyme with 
slicer activity (an Argonaute protein) into single-stranded small RNAs. 
By contrast, small RNA duplexes with a mismatch or bulge in the centre 
(right pathway) are not substrates for the slicer and thus become single- 
stranded in a cleavage-independent manner. The identity of the protein 
that carries out this unwinding is unknown. Single-stranded small RNAs 
are then loaded onto Argonaute proteins. The particular strand that 

is selected (sense or antisense) depends on thermodynamic stability. 

The loaded Argonaute proteins are guided to target mRNAs containing 
complementary sequence, and the expression of the corresponding 
genes is silenced. The mode of action of this silencing — whether the 
mRNA is cleaved or whether translation is just repressed — largely 
depends on the degree of complementarity between the target mRNAs 
and the Argonaute-associated small RNAs. b, Some small RNAs found 
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in Caenorhabditis elegans and plants are known to be produced in an 
RNA-dependent RNA polymerase (RdRP)-dependent manner. Natural 
transcripts (often aberrant RNAs) can be substrates for this type of small 
RNA synthesis. This does not occur in organisms that lack RdRP activity, 
such as mammals and Drosophila melanogaster. Single-stranded small 
RNAs generated in this way can then be loaded onto Argonaute proteins 
and silence gene expression. ¢, The PIWI subfamily of Argonaute proteins, 
which are germline specific, are loaded with piRNAs. These complexes 
function to silence transposons. Single-stranded precursors give rise to 
piRNAs, through a mechanism called the primary processing pathway. 
The proteins required for this pathway are unknown. The silencing of 
transposons by PIWI proteins simultaneously amplifies piRNAs in germ 
cells. This pathway is known as the secondary processing pathway (or the 
ping-pong amplification loop) and is conserved in a variety of organisms, 
including mice and zebrafish. In this pathway, the slicer activity of the 
PIWI proteins reciprocally forms the 5’ ends of piRNAs by cleaving 
transposon transcripts (piRNA precursors). Proteins required to form 
the 3’ end of piRNAs remain unidentified. 
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Table 1| Key proteins in RNA silencing in various organisms 


Protein Yeast Plant Nematode Fruitfly Mammal 
(Schizosaccharomyces (Arabidopsis thaliana) (Caenorhabditis (Drosophila 
pombe) elegans) melanogaster) Mouse Human 
RNaselll Der DCL1 DCR-1 DCR-1 DICER1 DICER1 
DCL2 DRSH-1 DCR-2 DROSHA DROSHA 
DCL3 DROSHA 
DCL4 
Argonaute: AGO subfamily Agol AGO1 ALG-1 AGO1 AGO1 AGO1 
AGO2 ALG-2 AGO2 AGO2 AGO2 
AGO4 3 others AGO3 AGO3 
AGO5 AGO4 AGO4 
AGO6 AGO5 (possibly 
AGO7 (ZIPPY) a pseudogene) 
AGO10 (ZLL, PNH) 
3 others 
Argonaute: PIWI subfamily None None ERGO-1 AGO3 MILI (PIWIL2) HILI (PIWIL2) 
PRG-1 PIWI MIWI (PIWIL1) HIWI (PIWIL1) 
PRG-2 AUB MIWI2 (PIWIL4) HIWI2 (PIWIL4) 
PIWIL3 (HIWI3) 
Argonaute: WAGO subfamily None None RDE-1 None None None 
SAGO-1 
SAGO-2 
PPW-1 
PPW-2 
CSR-1 
NRDE-3 
11 others 
Double-stranded-RNA-binding None FYE PASH-1 PASHA DGCR8 DGCR8 
domain (dsRBD)-containing RDE-4 R2D2 TRBP (TARBP2) TRBP (TARBP2) 
cofactor of RNaselll 
LOQS PACT (PRKRA) PACT (PRKRA) 
RNA-dependent RNA Rdp1 RDRI1 EGO-1 None None None 
polymerase (RdRP) RDR2 (SMD1) RRF-1 
RDR6 (SDE1, SGS2) RRF-3 
3 others 1 other 


Molecules that belong to these categories but have unknown functions are not listed but are indicated as ‘others’. Common synonyms are indicated in parentheses. Data were taken from refs 8, 12, 29,50 and 98. 


Drosha is present in a large complex, known as the microprocessor 
complex, which functions like a molecular ruler to determine the cleavage 
site in the pri-miRNAs’*”. In this complex, Drosha interacts with its 
cofactor, known as DGCR8 or Pasha (depending on the species), which 
also binds to dsRNA (through its dsRNA-binding domain; dsRBD)”*”. 
A typical metazoan pri-miRNA consists of a 33-bp stem, a terminal 
loop and ssRNA flanking segments. The flanking segments are crucial 
for binding to DGCR8, and the 33-bp stem is also required for efficient 
binding. Drosha can interact transiently with the stem of this ‘pre-cleavage’ 
complex, and the processing centre of the enzyme, located at ~11 bp from 
the ssRNA-dsRNA junction, makes a staggered pair of breaks in the 
RNA to create the ~65-70-nucleotide pre-miRNA. Thus, DGCR8 might 
function as the molecular anchor that measures the distance from the 
ssRNA-dsRNA junction. It is possible that the microprocessor complex 
could recognize the terminal loop as ssRNA and bind to the stem-loop 
structure in the opposite orientation. In this case, abortive cleavage can 
occur at an alternative site ~11 bp from the terminal loop. However, most 
pri-miRNAs contain internal bulges or weakly paired bases ~11 bp from 
the terminal loop that mitigate processing from this direction”. 

Although many of the sequences encoding miRNAs are located 
within introns, clusters encoding miRNAs that are processed directly 
by the spliceosome, instead of Drosha, were recently identified””’. The 
3’ end of the stem-loop precursor of these intronic miRNAs (known as 
mirtrons) coincides with the 3’ splice site of a small annotated intron 
and is cleaved in the same splicing pathway as pre-mRNA in the nucleus 
instead of by Drosha. Subsequently, the mirtron precursors, which are 
released by the spliceosome in the shape of a lariat (lasso), are linearized 
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by a de-branching enzyme. They then enter the miRNA-processing 
pathway directly (by mimicking the structural features of pre-miRNA 
hairpins) and are therefore exported to the cytoplasm and processed by 
a Dicer protein, bypassing Drosha-mediated cleavage. 

The imprecision of Drosha or Dicer cleavage could result in the 
production of a set of miRNA-miRNA* duplexes with a variety of 
5’ and 3’ ends. Most miRNAs in animals form imperfect hybrids with 
sequences in the target mRNA, with most of the pairing specificity being 
provided by the 5’-proximal region of the miRNA (that is, positions 2-8; 
also known as the seed region)”. Imprecise cleavage either alters the 
seed sequence or inverts the relative stabilities of the 5’ and 3’ ends of 
the duplex (see the section “Loading and sorting of small RNAs by the 
RISC’). The results of recent deep-sequencing studies of small RNAs, 
however, indicate that human cells might take advantage of such 
imprecise cleavage, because the generation of a diverse set of miRNAs 
from a single precursor could be a way of broadening the network of 
factors and processes that are regulated by miRNAs”. 


RNaselll-independent pathways of small RNA biogenesis 

In some systems, small RNAs do not seem to be produced in response 
to dsRNA, but silencing signals are still amplified. Because these small 
RNAs do not arise from dsRNA precursors, RNaselII enzymes cannot 
be involved in their generation. These findings therefore call into 
question the definition of RNAi. In this subsection, we describe the 
known RNaselll-independent pathways of small RNA production, 
including those that generate piRNAs, 21U-RNAs, and secondary 
siRNAs in Caenorhabditis elegans. 
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The small RNAs known as piRNAs were named for their ability to 
bind to a group of Argonaute proteins known as PIWI proteins. As noted 
earlier, members of the Argonaute family bind directly to small guide 
RNAs and lie at the core of all known RISCs*. Argonaute proteins consist 
of a variable amino-terminal domain and three conserved domains (the 
PAZ, middle (MID) and PIWI domains)*”*”’. The 3’ end of a small RNA 
interacts with the PAZ domain, whereas the phosphate group at the 
5’ end of small RNAs binds to a cleft bridging the MID domain and 
the PIWI domain” (see page 405). The PIWI domain hasan RNaseH-like 
folded structure’ and slicer activity (although some Argonaute proteins 
seem to have no slicer activity). There are three phylogenetic groups 
of Argonaute proteins”: the AGO subfamily (or AGO clade), named 
after the founding member Arabidopsis thaliana ARGONAUTE 1 
(AGO1); the PIWI subfamily, named after D. melanogaster PIWI 
(P-element-induced wimpy testis); and the WAGO (worm-specific 
Argonaute) subfamily of C. elegans-specific proteins. PIWI-subfamily 
proteins bind to piRNAs*’”’ (Table 1). These small RNAs have 
been found only in germ cells, and they are important for germline 
development and suppress transposon activity in the germline cells of 
mammals, fish and D. melanogaster. They are ~24-31 nucleotides (sli- 
ghtly longer than miRNAs), usually have a uridine at the 5’ end and carry 
a5’ monophosphate. Unlike mammalian miRNAs, but similarly to plant 
miRNAs, piRNAs have a 2’-O-methyl (2’-O-Me) modification on the 
nucleotide at the 3’ end, a modification that is carried out by a HEN1- 
like methyltransferase** “. If Dicer is mutated, the production of piRNAs 
is not affected, indicating that their biogenesis is distinct from that of 
miRNAs and siRNAs and does not involve dsRNA precursors’. 

The sequencing of small RNAs associated with D. melanogaster PIWI- 
subfamily proteins (PIWI, Aubergine (AUB) and AGO3)**“ showed that 
piRNAs associated with AUB and PIW1 are derived mainly from the 
antisense strand of retrotransposons, whereas AGO3-associated piRNAs 
arise mainly from the sense strand. AUB- and PIWI-associated piRNAs 
show a strong preference for uridine at their 5’ ends, whereas AGO3- 
associated piRNAs showa preference for adenosine at nucleotide 10. The 
first ten nucleotides of AUB-associated piRNAs can be complementary 
to the first ten nucleotides of AGO3-associated piRNAs. In addition, 
PIWI-subfamily proteins have slicer activity that allows them to cleave 
an RNA substrate opposite position 10 of their bound piRNA*™*. These 
observations suggest that piRNAs have a self-amplifying loop (Fig. 1), 
in which sense piRNAs associated with AGO3 cleave long antisense 
transcripts and guide the formation of the 5’ end of antisense piRNAs 
bound to AUB or PIWI, and vice versa. Thus, in this amplification loop, 
which is called the ping-pong cycle”, transposons are both a source of 
piRNAs and a target of piRNA-mediated silencing. After the resultant 
cleavage products have been loaded onto another member of the PIWI 
subfamily, further (as yet unidentified) nuclease activity generates the 
3’ end of the piRNA, with the specific size of the piRNA determined by 
the footprint of the PIWI-subfamily protein on the RNA, a step that seems 
to precede 2'-O-Me modification”. In each PIWI-subfamily protein, the 
PAZ domain might be positioned at a distance from the MID domain 
that corresponds to the length of each piRNA. Thus, the PAZ domain 
might function as part of a molecular ruler for processing piRNAs ofa 
defined size. Signatures of this amplification cycle are also apparent in 
zebrafish (Danio rerio) germ cells and in mammalian germ cells before 
the pachytene stage of meiosis during spermatogenesis”. 

PIWI-subfamily proteins and, presumably, their associated piRNAs 
are loaded into embryos from the ova’, implying that the piRNAs that 
initiate an amplification cycle of piRNA biogenesis (which generates 
secondary piRNAs) could be supplied by germline transmission. But 
several findings indicate that there must be mechanisms of piRNA 
biogenesis other than amplification induced by maternal piRNAs. 
First, the amplification cycle in D. melanogaster engages mainly 
AGO3 and AUB***, but piRNAs are still loaded onto PIWI, which is 
spatially separated from these proteins at the subcellular and cell-type 
levels****“*. Second, piRNAs derived from a particular piRNA cluster 
in the genome (the flamenco locus) associate almost exclusively with 
PIWI”. These findings indicate that flamenco-derived piRNAs are 


produced by a pathway independent of the amplification loop. Whether 
such a piRNA-biogenesis pathway exists remains to be determined. 

What at first seemed to be another type of small RNA, 21U-RNA, 
is found in C. elegans. These small RNAs are precisely 21 nucleotides 
and have a bias towards uridine at the 5’ end (but not in the remaining 
20 nucleotides), and the genetic regions that encode them contain a 
characteristic sequence motif ~42 bp upstream of the first nucleotide 
of the small RNA”. It is possible that these RNAs are derived from 
thousands of separate, autonomously expressed, loci that are broadly 
scattered in two large regions of one chromosome. They are expressed 
solely in the germ line and interact with the PIWI-subfamily protein 
PRG-1 (refs 47, 48); therefore, 21U-RNAs are the C. elegans equivalent of 
piRNAs by definition. Like piRNAs, they depend on PRG-1 activity for 
their accumulation and are independent of DCR-1 (the C. elegans Dicer 
protein) for their production. C. elegans with mutations in prg-1 have 
a smaller brood and a temperature-sensitive sterile phenotype, which 
is consistent with the idea that PIWI-subfamily proteins are involved 
in germline maintenance. Like the piRNAs found in mammalian germ 
cells in pachytene****, 21U-RNAs have remarkable sequence diversity 
but lack obvious targets. 

Small RNAs witha similar role to piRNAs have also been found in the 
ciliate Tetrahymena thermophila. These scan RNAs (scnRNAs) direct 
the elimination of transposon-like DNA sequences and associate with 
a PIWI-subfamily protein, TWI1 (ref. 8) but, in contrast to piRNAs and 
21U-RNAs, are produced by a Dicer-dependent pathway”. 

These three examples (piRNAs, 21U-RNAs and scnRNAs) indicate 
that the core PlWI and piRNA machinery might have evolved to produce 
small RNAs and silence targets by different strategies. 

RNA silencing pathways include mechanisms that downregulate 
endogenous genes and restrain the expression of selfish or exogenous 
genetic material, and these pathways often share common components 
such as Dicer. Therefore, there should be competition between different 
silencing pathways for particular components. Ways to overcome such 
competition should also exist; for example, by amplifying a weak silencing 
signal. In C. elegans, distinct Argonaute proteins operate at different stages 
of RNAi, directing gene silencing in a sequential manner” — the second 
stage of which involves RNaselI]-independent biogenesis of small RNAs. 
First, a primary Argonaute protein (such as RDE-1 for exogenous siRNAs 
(exo-siRNAs) and ERGO-1 for endo-siRNAs) is guided by ‘primary’ 
siRNAs (that is, a first round of siRNAs), which have been generated 
from long dsRNAs by DCR-1. Second, the silencing signal is amplified 
by the production of ‘secondary’ siRNAs by the action of RNA-dependent 
RNA polymerases (RdRPs) (Fig. 1). These secondary siRNAs then bind 
differentially to secondary Argonaute proteins (SAGOs, members of 
the WAGO subfamily), which mediate downstream silencing. In plants, 
RNAs with aberrant features, including lack of a poly(A) tail and lack of 
a5’ cap, are copied into double-stranded forms by RdRPs and become 
substrates for Dicer, which converts them into siRNA duplexes”. By 
contrast, the C. elegans somatic RdRP mostly produces 21-nucleotide, 
single-stranded, 5’-triphosphorylated small RNAs directly from the 
target mRNA in a primer-independent manner without the need for 
Dicer-mediated cleavage of dsRNA***. Such recruitment of an RRP 
directly to the target mRNA allows dsRNA synthesis without consuming 
the siRNAs generated in response to the original trigger, although it is 
unclear how the 3’ end of these secondary siRNAs is formed and what 
the molecular ruler is that determines their size. 


Blurring of the boundaries between small RNA types 

As described above, the three main classes of small RNA — siRNAs, 
miRNAs and piRNAs — are distinct in their biogenesis and cellular 
roles. However, recent findings blur these distinctions and show that 
there are even more-complex interactions between factors involved in 
small RNA biogenesis. Deep sequencing of small RNAs from somatic 
tissues and cultured somatic cells in D. melanogaster has uncovered 
another class of small RNA, consisting of 3’-methylated, 21-nucleotide 
RNAs derived from the D. melanogaster genome. These endogenous 
RNAs are derived from transposons and from several loci, including 
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loci that encode cis-natural antisense transcript pairs, and long stem- 
loop structures containing many mismatched pairs in their stems”. 
In D. melanogaster, distinct Dicer-containing complexes produce 
exo-siRNAs and miRNAs*””. DCR-1 generates miRNAs, acting with 
its dsRNA-binding protein partner, Loquacious (LOQS)°™, and the 
miRNAs are loaded onto AGO1. By contrast, DCR-2, together with 
its dsRNA-binding protein partner, R2D2 (ref. 62), generates exo- 
siRNAs, which are loaded onto AGO2. Like exo-siRNAs, the recently 
discovered endogenous small RNAs are produced by the DCR-2- 
dependent pathway and are loaded onto AGO2, and they are therefore 
called endo-siRNAs. However, the generation of many endo-siRNAs 
requires LOQS***, the dsRBD-containing partner of DCR-1 in the 
miRNA pathway*”’, but not R2D2, the partner of DCR-2 (ref. 62). 
In D. melanogaster deficient in DCR-2 or AGO2, the expression of 
transposons increases, so endo-siRNAs might be the main mechanism 
for silencing ‘selfish’ genetic elements in somatic cells, which lack 
the piRNA pathway. Therefore, endo-siRNAs and piRNAs are 
fundamentally similar in that they defend organisms against nucleic- 
acid-based ‘parasites. This finding also shows that D. melanogaster 
has two RNAi pathways that repress transposon expression. Mouse 
oocytes have also been shown to contain endo-siRNAs. These RNAs 
are derived from various sources, including transposons*™; however, 
some are processed from overlapping regions of functional genes and 
their cognate pseudogenes. This finding suggests that pseudogenes, 
which have been thought to be non-functional protein ‘fossils, might 
regulate the expression of their founder genes. 

Although siRNAs and miRNAs are categorized in terms of their origin 
rather than their size or function”, the discovery of endo-siRNAs makes 
it difficult to distinguish between siRNAs and miRNAs. This blurring of 
the boundaries between the different types of small RNA has interesting 
evolutionary implications. The long stem-loop structures that are 
processed to form endo-siRNAs are reminiscent of the pre-miRNAs in 
plants. One hypothesis for the evolutionary origin of plant miRNAs is 
that new plant miRNA loci might evolve from the inverted duplication 
of founder loci, which when transcribed would result in hairpin RNAs”. 
These hairpin RNAs would have almost perfect self-complementarity 
and might be processed by Dicer-like enzymes other than DCL1, the 
main miRNA-processing enzyme in plants, because DCL1 has limited 
activity against such substrates. Subsequent acquisition of mutations 
as a result of genetic drift would produce a hairpin with imperfect 
complementarity, which could then be processed by DCLI1. Thus, the 
stem-loop structures from which endo-siRNAs are derived could be 
evolutionary intermediates that are gradually transformed into miRNA 
precursors. It is possible that such an adaptive switch could also occur 
during the evolution of miRNA-encoding genes in D. melanogaster, in 
which DCR-1 would then generate miRNAs instead of endo-siRNAs 
being generated by DCR-2. 


Loading and sorting of small RNAs by the RISC 

In gene silencing pathways initiated by dsRNA precursors, Dicer-medi- 
ated cleavage yields small dsRNA intermediates (small RNA duplexes). 
These small RNA duplexes must be dissociated into ‘competent’ single 
strands in order to function as guides for RISCs. For each small RNA 
duplex, only one strand, the guide strand, is loaded onto a specific Argo- 
naute protein and assembled into the active RISC; the other strand, the 
passenger strand, is destroyed. Many eukaryotes express more than 
one Argonaute protein, and these proteins bind to small RNAs ina 
sequence-independent manner. So how are small RNAs sorted and 
loaded onto a specific Argonaute protein? 


Loading 

A small RNA generated from dsRNA precursors is converted from a 
duplex into a single-stranded form as it is loaded into the RISC. The key 
steps in converting the RISC from its precursor form (the pre-RISC), 
which contains the small RNA duplex, to its mature form (the holo- 
RISC), which contains the guide strand, are small RNA strand unwinding 
and preferential strand selection. The prevalent view of RISC loading is 
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that thermodynamic asymmetry along small RNA duplex determines 
which RNA strand is retained and which is discarded. More specifically, 
the strand that has its 5’ end at the thermodynamically less stable end of 
the small RNA duplex is preferentially loaded into the RISC as the guide 
strand, a phenomenon referred to as the asymmetry rule™. 

For siRNAs, the known interactions between Dicer and the Argonaute 
proteins® indicate that the production of the small RNA and the 
assembly of the RISC might be physically coupled. For example, in 
D. melanogaster, DCR-2 does not simply transfer siRNAs to a distinct 
RISC but, instead, forms part of the RISC together with the siRNAs, 
indicating that the role of DCR-2 extends beyond the initiation phase. 
The loading of siRNA duplexes onto AGO2 is facilitated by the RISC- 
loading complex, which contains DCR-2 and its dsRBD-containing 
partner, R2D2 (refs 62, 67). The particular strand of the siRNA duplex 
that is loaded onto AGO2 seems to be determined by the orientation 
of the DCR-2-R2D2 heterodimer on the siRNA duplex®. R2D2 is 
thought to sense the thermodynamic stability of the siRNA duplexes and 
bind to the more stable end of the siRNA, whereas DCR-2 is recruited to 
the less stable end. The heterodimer probably recruits AGO2 through 
an interaction between DCR-2 and AGO2. Previous models have 
proposed that the transition from a double-stranded silencing trigger 
to a single-stranded one is mediated by an unidentified ATP-dependent 
RNA helicase. However, the unwinding of the siRNA duplex and the 
loading ofa single strand into the RISC are facilitated by the slicing of 
the unincorporated (passenger) strand by AGO2, a process that does 
not require ATP” (Fig. 1). Cleavage in the middle of the passenger 
strand, as though the passenger strand were an mRNA target, would be 
expected to reduce the annealing temperature and the free energy of 
duplex formation, which in turn facilitates the separation of the siRNA 
strands. These data support a model in which siRNAs are initially loaded 
as duplexes onto an AGO2-containing pre-RISC (Fig. 2). 

By contrast, in humans, pre-miRNAs are known to bind to a 
preformed trimeric complex of AGO2, DICER] and DICER1’s dsRBD- 
containing partner, TRBP”. This complex can cleave target RNAs using 
pre-miRNA and can distinguish miRNA from miRNA”, in the absence 
of ATP hydrolysis’””’, suggesting that DICER1-mediated cleavage 
and sensing of thermodynamic stability occur in series in the AGO2- 
DICER1-TRBP complex. 

This process by which a pre-RISC is converted to a holo-RISC can also 
occur by a slicer-independent mechanism. Three of the four Argonaute 
proteins in humans (AGO1, AGO3 and AGO4) lack slicer activity but are 
nonetheless loaded with single-stranded guide siRNAs”™"”*. Similarly, 
single-stranded miRNAs are found associated with AGO2 in humans, 
despite the expectation that mismatches in the unwound pre-miRNA 
should block the passenger-strand cleavage activity of AGO2. Thus, 
a cleavage-independent (bypass) mechanism for RISC assembly must 
exist. RNA helicase A has been identified as a candidate for unwinding 
the duplex in this process”*, 


Sorting 

Once assembled, RISCs mediate a range of the effector steps in all RNA 
silencing mechanisms, from repressing translation to maintaining 
genome stability. The specialized functions of RISCs are likely to 
result from the particular proteins that associate with each Argonaute 
protein. In other words, the different RISC variants are distinguished 
by their constituent Argonaute protein. Thus, it is crucial that a specific 
set of small guide RNAs is directed to a specific Argonaute protein. 
Analyses of how different types of small RNA are channelled to different 
Argonaute proteins show that there are multiple mechanisms: the 
determinants for small RNA sorting vary from the structure of the small 
RNA duplex to the identity of the 5’ nucleotide and the presence and 
extent of modifications to this nucleotide. 

In D. melanogaster, pre-miRNAs are processed by DCR-1, whereas 
exo-siRNA duplexes are produced by DCR-2 from long dsRNAs™* 
(Fig. 2a). Small RNAs then seem to be loaded onto either AGO1 or 
AGO2, depending on the structure ofa small intermediate RNA duplex”. 
If the duplex has a bulge in the middle (frequently observed in miRNA 
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precursors), the small RNA is routed to AGO1. If the duplex is perfectly 
matched, the small RNA is channelled to AGO2. This is because the 
DCR-2-R2D2 heterodimer, which recruits AGO2 to form the pre-RISC, 
binds well to highly paired small RNA duplexes but poorly to duplexes 
with central mismatches. Thus, the DCR-2-R2D2 heterodimer not only 
determines the polarity of siRNA loading on the basis of thermodynamic 
stability rules but also functions as a gatekeeper for AGO2-containing 
RISC assembly, promoting the incorporation of siRNAs over miRNAs. 
These observations suggest that each siRNA duplex dissociates from the 
active site of the Dicer protein after it is produced and is subsequently 
recaptured by the DCR-2-R2D2 heterodimer. However, although AGO1 
favours binding to small RNA duplexes with central mismatches, a large 
proportion of miRNA-miRNA* duplexes with a base-paired central 
region still enter into AGO1-containing RISCs”, suggesting that the 
AGO1-loading pathway is selective and not a default pathway for small 
RNAs rejected by the AGO2 pathway. 

The identity of the nucleotide at the 5’ end and the extent to which 
this nucleotide is phosphorylated also influence which Argonaute 


Figure 2 | Sorting of small RNAs onto distinct Argonaute 
proteins. Small RNAs are sorted onto specific Argonaute 
proteins, and this process occurs by several mechanisms. a, In 
Drosophila melanogaster, small RNAs originating from a duplex 
are loaded onto one of two Argonaute proteins (AGO1 or 
AGO2), on the basis of the structure of the small RNA duplex. If 
the duplex has a mismatch or a bulge in the centre (as miRNAs 
do), then the RNA is routed to AGO1. If the duplex is perfectly 
matched (as siRNAs are), then the small RNA is routed to AGO2. 
This selectivity occurs because the small RNAs are loaded onto 
Argonaute proteins from a Dicer-containing complex, and the 
two forms of Dicer, DCR-1 and DCR-2, associate with different 
RNA structures. DCR-2 pairs with R2D2, and this heterodimer 
binds to highly paired small RNA duplexes but recognizes small 
RNA duplexes with a central mismatch only poorly. AGO2 
favours binding to DCR-2-R2D2 over binding to the other Dicer- 
containing complex, DCR-1-LOQS, which binds to small RNAs 
with bulges. Further processing into single-stranded 

small RNAs is described in Fig. 1. b, Arabidopsis thaliana 
miRNAs and trans-acting siRNAs (ta-siRNAs) have a 5’ uridine 
and preferentially associate with AGO1. By contrast, AGO2 and 
AGOS show preferences for small RNAs containing 5’ adenosines 
and 5’ cytidines, respectively. However, it is unlikely that the 

5’ nucleotide is the sole determinant of selective loading in 

A. thaliana. c, Secondary endo-siRNAs in Caenorhabditis 
elegans, as well in Schizosaccharomyces pombe, have a striking 
strand bias in which only the antisense siRNA is loaded onto 
Argonaute proteins. These siRNAs correspond to the RNA strand 
synthesized by RdRP. In C. elegans, RdRP produces small RNAs 
directly from the target mRNA in a primer-independent manner. 
Thus, these secondary small RNAs show negative polarity, and 
this mechanism reinforces the silencing carried out by 

the primary small RNAs. 


protein the small RNA associates with. In contrast to what is observed. 
in D. melanogaster, processing by Dicer may be uncoupled from 
association with Argonaute proteins in A. thaliana because, in this 
species, the miRNAs are all generated by one particular Dicer protein, 
DCLI1, but are still sorted and loaded onto different Argonaute proteins. 
In A. thaliana, miRNAs and trans-acting siRNAs (ta-siRNAs), a class 
of small RNAs that regulate plant development”’, generally have a 
5‘ uridine and preferentially associate with AGO] (ref. 76) (Fig. 2b). By 
contrast, AGO2 associates preferentially with small RNAs containing 
5’ adenosines, and AGOS5 prefers 5’ cytidines. Interestingly, if the 
opposite strand of a miRNA (that is, miRNA”) has a 5’ adenosine or a 
5’ cytidine, it is bound to AGO2 or AGOS, respectively. These findings 
have led to the hypothesis that the binding affinity of Argonaute proteins 
for small RNAs is determined by the nucleotide at the 5’ end. Although 
these 5’-nucleotide preferences generally hold true for these Argonaute 
proteins in plants, exceptions have been observed: the A. thaliana 
miRNA known as miR-172 has a 5’ adenosine but preferentially 
associates with AGO] (ref. 77); and AGO7 preferentially associates with 
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miR-390, which has a 5’ adenosine”. Therefore, the 5’ nucleotide does 
not seem to be the sole determinant of Argonaute association. 

Another mechanism might operate for secondary siRNAs in C. elegans. 
These small RNAsare specifically loaded onto SAGOs”. Secondary siRNAs 
carry a 5’-triphosphate modification™””, the hallmark of RdRP products, 
which might function as a recognition element for SAGO binding while 
excluding binding by a primary Argonaute, such as RDE-1. 

Endo-siRNAs in C. elegans (including the secondary siRNAs just 
mentioned) and Schizosaccharomyces pombe (fission yeast) have 
a striking strand bias in which only the antisense siRNA strand, 
corresponding to the RNA strand synthesized by RdRP, is loaded into 
Argonaute-containing complexes. Because C. elegans RdRPs produce 
small RNAs directly from the target mRNA, in a primer-independent 
manner (Fig. 2c), all secondary siRNAs have a negative polarity and 
function to reinforce the silencing of the target mRNA”. In S. pombe, 
the strand bias is probably the result of a different mechanism. The 
physical association of Dicer with an RdRP-containing complex known 
as RDRC and an Argonaute-containing complex known as the RNA- 
induced transcriptional silencing complex (RITS) (see page 413) may 
facilitate the loading of siRNAs onto Argonaute proteins in a directional 
manner as Dicer moves along and cleaves the dsRNA products of RdRP, 
giving rise to an antisense strand bias. This suggests that the polarity of 
Dicer processing defines the polarity of the siRNA strand loaded onto 
the Argonaute protein. 

Argonaute proteins have diversified over evolutionary timescales, 
evolving a range of functions*””. These findings about small RNA sorting 
imply that the diversification of the Argonaute proteins is a consequence 
of which small RNA they recruit. It is possible that the conformation of 
the Argonaute protein dictates which small RNAs it partners, but the 
structures of eukaryotic Argonaute proteins will need to be determined 
before this can be assessed. 


Safeguards in silencing pathways 

During RNA silencing, a single non-sequence-specific RNA-binding 
protein (Argonaute) is loaded with small guide RNAs with a variety of 
sequences, resulting in effector complexes (RISCs). Thus, this system 
requires gatekeepers to ensure that Argonaute can bind to small guide 
RNAs but not to degraded small RNAs, thereby avoiding ‘off-target’ 
silencing. Such gatekeeper systems seem to depend mainly on structural 
features specific for small guide RNAs. 

As described earlier, Dicer helps to load siRNAs into the RISC, 
preventing siRNAs from diffusing freely in the cytoplasm after 
their production. This function of Dicer probably also aids in the 
discrimination of genuine siRNAs from various RNA-degradation 
products in the cell. Processing by RNaselII enzymes (such as 
Dicer) characteristically yields small RNAs with 5’ monophosphates 
and 3’ two-nucleotide overhangs. The PAZ domain of Argonaute 
proteins might, as a first step, distinguish degraded RNAs (derived 
from unrelated pathways) from these small RNAs by binding to the 
characteristic 3’ overhangs of the small RNAs*”. In addition, to become 
incorporated into the RISC and mediate cleavage of the target mRNA, 
the guide strand of an siRNA must have a phosphate group at the 
5’ end’. In humans, the 5’ end of siRNAs is phosphorylated by the 
enzyme CLP1 (ref. 79), which also has roles in splicing transfer RNAs 
and forming the 3’ ends of mRNAs. Interestingly, both tRNA splicing 
and mRNA 3’-end formation occur in the nucleus", suggesting that 
siRNA duplexes with a 5’ hydroxyl group are transported to, or diffuse 
into, the nucleus and, after phosphorylation by CLP1, are exported to 
the cytoplasm and assembled into the RISC. 

Amplification of the silencing signal needs to be balanced against 
the dangers of amplifying off-target silencing. For example, the slicer- 
mediated ping-pong mechanism for piRNA production does not 
lead to ‘transitive’ RNA silencing (in which RdRPs synthesize siRNAs 
complementary to sequences upstream or downstream of the initial 
trigger region in the target mRNA). Instead, it leads to conservative 
amplification of functional primary piRNA sequences (those inherited 
by germline transmission). However, it is conceivable that any off-target 
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events mediated by RdRPs could lead to a chain reaction or transitive 
effect of silencing with deleterious consequences. Thus, there must be 
safeguards to prevent the pervasive use of RdRPs. A striking aspect of 
RdRP-based trigger amplification is that amplification occurs only 
when a target has been engaged, so amplification of the silencing signal 
is limited to cases in which there is a real target*””’. In C. elegans, the 
processing of the trigger dsRNA and the loading of primary siRNAs 
into the RDE-1-containing complex seem to be inherently inefficient, 
limiting the first round of target recognition by RDE-1-containing 
complexes and minimizing the risk of amplifying off-target silencing 
reactions”. In addition, each secondary siRNA seems to be generated 
by non-processive self-termination by RdRP, thereby restricting 
transitive effects*!>*. Furthermore, secondary siRNAs associate with 
SAGOs, which lack catalytic residues for cleaving mRNAs, suggesting 
that these complexes cannot generate cleaved substrates for further 
amplification, which in turn would prevent them from inducing the 
exponential generation of secondary siRNAs” (but see also ref. 53 for 
a conflicting viewpoint). SAGOs are also present in limited supply 
and thus have a restricted capacity to support multiple simultaneous 
silencing reactions. 

Another factor is that in C. elegans and S. pombe the RNAi machinery 
is negatively regulated by a conserved siRNA nuclease called enhanced 
RNAi (ERI-1 and Eril, respectively)’. In S. pombe, transgene silencing 
is linked to a protein complex resembling the TRAMP complex of 
Saccharomyces cerevisiae (budding yeast), which carries out surveillance 
in the nucleus, targeting aberrant transcripts for degradation by the 
exosome™. Thus, RNAi in S. pombe is actively restricted from exerting 
its effects throughout the genome and seems to be subject to competition 
from RNA quality-control machinery. 


Target-sensing modes and effector modes of the RISC 

When the RISC is loaded with the guide strand of a small RNA, how 
does it find its target mRNA? Most of the binding energy that tethers 
a RISC to a target mRNA is from nucleotides in the seed region of the 
small RNA. It seems that the accessibility of the target site can be 
sensed by the intrinsic, nonspecific affinity of RISC for ssRNA, which 
follows the initial specific association between the RISC and the target 
(through the 5’ seed region of the small RNA)**. But the accessibility 
of the target site correlates directly with the efficiency of cleavage, 
indicating that the RISC cannot unfold structured RNA. 

Target mRNAs are present in the cell in complex with ribonucleo- 
proteins (RNPs)”, so target accessibility is also controlled by several 
RNA-binding proteins that either mask the target binding site or 
facilitate unfolding of the target. Therefore, the function of a RISC seems 
to be context-dependent, with its effector mode influenced not only by 
the structures of the small-RNA-binding sites on the target but also by the 
particular proteins associated with each Argonaute protein. For example, 
animal miRNAs silence gene expression by at least three independent 
mechanisms through binding sites that are mostly in the 3’ untranslated 
region of target mRNAs: by cleaving mRNAs, by repressing their 
translation and/or by promoting mRNA degradation**”. However, the 
contribution of translational repression or mRNA degradation to gene 
silencing seems to differ for each miRNA-mRNA pair. Thus, the final 
outcome of miRNA regulation is probably affected by other proteins 
interacting with the targeted mRNA or RISC and counteracting the 
effects of the miRNA, resulting in differential regulation depending on 
the proteins present in each tissue”. 


Regulation of silencing pathways 

So far, the pictures of RNA silencing pathways that we have built up 
(shown in Figs 1 and 2) are static. To gain further insight into silencing 
processes, it is important to incorporate information about how these 
pathways are regulated. It is already clear that competition between 
different silencing pathways (for example, competition between endo- 
siRNAs and miRNAs for LOQS in D. melanogaster) is a key step in 
how each stage of the RNAi mechanism is regulated. Many plant and 
animal viruses are known to encode suppressor proteins that block 
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host RNAi, and therefore silencing, at various stages”. Cellular proteins 
can also regulate RNAi. For example, processing to form the human 
miRNA let-7, which is a tumour suppressor and cell-cycle regulator, is 
post-transcriptionally inhibited in embryonic cells by the pluripotency 
factor LIN28, which seems to block the microprocessor-complex- 
mediated cleavage of pri-let-7 and the Dicer-mediated processing of 


pre-let-7 in series”*”*. By contrast, in humans, signalling mediated by 


the transforming growth factor-§ (TGF-B) and bone morphogenetic 
protein (BMP) family of growth factors rapidly increases the production 
of mature miR-21 (which is oncogenic), by promoting the processing of 
pri-miR-21 into pre-miR-21 by DROSHA™. More specifically, TGF-B- 
and BMP-specific signal transducers of the SMAD family are recruited 
to pri-miR-21 in complex with the RNA helicase p68, a component of the 
microprocessor complex, facilitating the accumulation of pre-miRNA. 
In addition, heterogeneous nuclear RNP Al (hnRNP A1), a well-known 
regulator of precursor mRNA splicing, also assists DROSHA to crop 
and release pre-miR-18 efficiently, perhaps by refolding the hairpin 
or by creating a cleavage site for DROSHA through direct binding to 
the pri-miRNA”. This implies that some hairpins within pri-miRNAs 
might form and be processed only after the binding of a protein with 
RNA chaperone activity. 

The activity of the RISC can also be regulated. In A. thaliana, 
the non-protein-coding gene IPS1 (INDUCED BY PHOSPHATE 
STARVATION 1) contains a motif with sequence complementarity to 
the phosphate-starvation-induced miRNA mik-399, but the pairing is 
interrupted by a mismatched loop at the expected miRNA cleavage site”. 
IPSI mRNA is not cleaved but, instead, sequesters miR-399. Thus, IPS1 
overexpression results in increased accumulation of the target of miR-399, 
PHO2 mRNA. The idea of target mimicry introduces unanticipated 
complexity into the network of RNA-regulatory interactions and raises 
the possibility that a large number of mRNA-like non-coding RNAs 
recently identified in humans” could be attenuators of the regulation 
of small-RNA-Argonaute complexes. 


Perspective 

Recent studies hint that human cells contain a large number of small 
RNAs similar to miRNAs or siRNAs, with the potential to regulate the 
expression of almost all human genes. The future challenges in this 
field are clear. Many questions remain to be answered. How many types 
of small RNA are there? How are these small RNAs generated? What 
are their biological functions? How are these pathways regulated? One 
potential problem is that because many types of small RNA are modified 
at their 5’ and 3’ ends”, it is unclear whether the current sequencing 
technologies are sampling the entire range of small RNAs present in 
cells. But next-generation sequencing technologies” should soon help 
to uncover the full range of small RNA molecules. 

One major challenge will be to identify how specific RNA-binding 
proteins affect the final outcome of gene regulation by small RNAs, 
given that RNAs in a cell are usually associated with multiple proteins 
that regulate many aspects of gene expression. For example, genome- 
wide in vivo approaches using a combination of immunoprecipitation 
and high-throughput sequencing will be required to establish protein- 
mRNA interactions or RNP complex occupancy at certain regions of 
mRNA, where expression is suppressed. 

Finally, changes in the activity and specificity of silencing pathways could 
create quantitative and qualitative genetic variation in gene expression, 
thereby generating new gene-expression networks. Such changes might 
have contributed to many processes, including human evolution’®. Given 
that all vertebrates have almost exactly the same number of protein-coding 
genes and therefore cannot readily be distinguished in this way, it might 
be prophetic that the first small guide RNA to be identified, the C. elegans 
miRNA lin-4, has been found to regulate a gene involved in the timing of 
development”. In humans, unlike other mammals, the brain tissue of 
newborns continues to grow at a similar rate to that of the fetus. This is 
a good example of a change in developmental timing, and there is much 
speculation about whether changes in this rate contributed to the evolution 
of humans as a new species. a 
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A three-dimensional view of the molecular 


machinery of RNA interference 


Martin Jinek' & Jennifer A. Doudna’*+ 


In eukaryotes, small non-coding RNAs regulate gene expression, helping to control cellular metabolism, 
growth and differentiation, to maintain genome integrity, and to combat viruses and mobile genetic elements. 
These pathways involve two specialized ribonucleases that control the production and function of small 
regulatory RNAs. The enzyme Dicer cleaves double-stranded RNA precursors, generating short interfering 
RNAs and microRNAs in the cytoplasm. These small RNAs are transferred to Argonaute proteins, which 
guide the sequence-specific silencing of messenger RNAs that contain complementary sequences by 

either enzymatically cleaving the mRNA or repressing its translation. The molecular structures of Dicer and 
the Argonaute proteins, free and bound to small RNAs, have offered exciting insights into the molecular 


mechanisms that are central to RNA silencing pathways. 


The discovery that RNA interference (RNAi) and related small-RNA- 
mediated pathways have central roles in the silencing of gene expression 
in eukaryotic cells has profoundly altered the understanding of gene 
regulation. At least 30% of human genes are thought to be regulated 
by microRNAs, one of the classes of small RNA". In a wide range of 
organisms, including petunias, nematodes, fruitflies, zebrafish and 
mice, mutations in the genetic regions encoding the protein and/or 
RNA components of RNAi result in severe and sometimes lethal defects 
in cell growth and development”. These findings have raised many ques- 
tions about how and why this widespread RNA-mediated regulation of 
genes evolved and how these mechanisms enable gene expression to be 
precisely tuned in response to internal and external stimuli. 

RNAi and related gene-silencing pathways are initiated by the pro- 
duction of small RNAs (~20-30 nucleotides) with sequences that are 
complementary to portions of the transcripts that they regulate. There 
are three main classes of small regulatory RNA: short interfering RNAs 
(siRNAs), microRNAs (miRNAs) and PIWI-interacting RNAs (piRNAs) 
(see page 396 for a review of the currently recognized classes of small 
regulatory RNA). The biogenesis and mechanism of action of the main 
types of small RNA are described in Box 1. In brief, siRNAs and miRNAs 
are generated from double-stranded RNA (dsRNA) precursors that are 
produced in, or introduced into, cells, and their generation depends on 
the ribonuclease (RNase) Dicer’. These small RNAs subsequently asso- 
ciate with members of the Argonaute family of proteins, which function 
as the core components of a diverse set of protein-RNA complexes called 
RNA-induced silencing complexes (RISCs)*. RISCs use the small RNAs 
as guides for the sequence-specific silencing of messenger RNAs that 
contain complementary sequence through inducing the degradation of 
the mRNAs or repressing their translation. In addition, in certain organ- 
isms, a specialized nuclear Argonaute-containing complex, known as 
the RNA-induced transcriptional silencing complex (RITS), mediates 
transcriptional gene silencing by inducing heterochromatin formation? 
(see page 413). The biogenesis of the piRNA class of small RNAs also 
involves proteins belonging to the Argonaute family but differs markedly 
from that of siRNAs and miRNAs’ (Box 1). 


Biochemical and structural biology studies have provided funda- 
mental insights into the molecular details of RNAi and its possible 
evolutionary underpinnings. Structural aspects of how viruses inhibit 
host-cell RNAi pathways are described in refs 7 and 8. In this Review, 
we focus on two specialized RNases: Dicer, which functions as a molec- 
ular ruler in siRNA and miRNA biogenesis; and Argonaute, a versatile 
RNA-guided molecular machine that cleaves, or otherwise represses, 
target RNAs. We highlight the ways in which recently obtained molecu- 
lar structures of these two proteins underlie the current mechanistic 
understanding of RNA silencing. 


Structural aspects of small RNA biogenesis 

The production of siRNAs and miRNAs relies on the endonucleolytic 
processing of dsRNA precursors. In a cell, long dsRNAs can arise from 
the replication of RNA viruses, from the transcription of convergent 
cellular genes or mobile genetic elements, and from self-annealing cel- 
lular transcripts. Dicer, an endonuclease belonging to the RNasellII 
family, processes these long dsRNA molecules to yield siRNA duplexes 
of ~21-25 nucleotides’. By contrast, miRNAs are generated from 
endogenous transcripts (known as primary miRNAs, pri-miRNAs) 
that form stem-loop structures (Box 1). The hairpin region is excised 
from the pri-miRNA in the nucleus by the endonuclease Drosha’, 
another RNasellI-family enzyme. After its export to the cytoplasm, 
the hairpin (known as a precursor miRNA, pre-miRNA) undergoes 
another endonucleolytic cleavage, which is catalysed by Dicer, gen- 
erating a miRNA-miRNA* duplex (where miRNA is the antisense, 
or guide, strand and miRNA* is the sense, or passenger, strand) of 
~21-25 nucleotides. 

Two features of siRNAs and miRNAs ensure that they are efficiently 
incorporated into the RISC: the length of the duplex; and characteristic 
5’ and 3’ ends, carrying a monophosphate group and a dinucleotide 
overhang, respectively” ”. Recent structural studies of a prokaryotic 
RNasellI and a eukaryotic Dicer have provided insights into how these 
crucial features of siRNAs and miRNAs are generated during their 
biogenesis by Drosha and/or Dicer. 
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Box 1| Biogenesis and mechanism of action of the main classes of small regulatory RNA 
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Small regulatory RNAs are non-coding RNA molecules that silence target 
RNAs in a sequence-specific manner. The main classes of small RNA 


are short interfering RNAs 


(siRNAs), microRNAs (miRNAs) and PIWI- 


interacting RNAs (piRNAs). 

The first class, siRNAs, mediate RNAi by downregulating target RNAs 
through slicing; that is, by endonucleolytic cleavage (see figure, panel a; 
note that RNAs are not drawn to the same scale in each panel, and m’G 
denotes 7-methylguanosine). These small RNAs are derived from long 
double-stranded RNA (dsRNA) molecules that result from RNA virus 


replication, convergent tra 


nscription of cellular genes or mobile genetic 


elements, self-annealing transcripts or experimental transfection. The 


endonuclease Dicer functi 
~21-25-nucleotide interva 


ons as amolecular ruler to cleave the dsRNA at 
s. After Dicer-mediated cleavage, one strand of 


the siRNA duplex (the guide strand) is loaded onto an Argonaute protein 


at the core of an RNA-indu 


ced silencing complex (RISC). (For simplicity, 


in the figure, the RISC is represented just by an Argonaute protein.) 


Argonaute loading takes p 
complex that consists of a 
protein (known as TRBP in 


lace in the RISC-loading complex, a ternary 
n Argonaute protein, Dicer and a dsRNA-binding 
humans). During loading, the non-guide 


(passenger) strand is cleaved by an Argonaute protein and ejected. The 


Argonaute protein then us 


es the guide siRNA to associate with target 


RNAs that contain perfect 


y complementary sequence and then catalyses 


the slicing of these targets. After slicing, the cleaved target RNA is 
released, and the RISC is recycled for another round of slicing. 

The second class, miRNAs, are encoded in the genome. Whereas 
plant miRNAs direct the slicing of target messenger RNAs, much like 


siRNAs, animal miRNAs si 


lence target mRNAs without slicing (panel b). 


These small RNAs are transcribed from endogenous miRNA genes 
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as primary transcripts (pri-miRNAs), containing ~65-70-nucleotide 
stem-loop structures. The hairpin structure is excised in the nucleus by 
the Drosha-DGCR8 complex to yield a precursor miRNA (pre-miRNA). 
Inthe cytoplasm, Dicer cleaves the pre-miRNA, producing amiRNA- 
miRNA* duplex (where miRNA denotes the guide strand and miRNA* 
the passenger strand). The guide strand is loaded onto an Argonaute 
protein. Typically, animal miRNAs are only partly complementary to 
sequences in the 3’ untranslated regions of their target mRNAs, and this 
ack of complementarity prevents the target from being sliced by the 
Argonaute protein. In addition, some Argonaute proteins involved in the 
miRNA pathway lack the catalytic residues needed for slicer activity. The 
mechanism of miRNA-mediated silencing is unresolved but is thought 
to occur by repression of target mRNA translation and removal of mRNA 
poly(A) tails (that is, deadenylation), which leads to mRNA degradation. 

The third class, piRNAs, are ~24-31 nucleotides, and they silence 
transposons (mobile genetic elements) in animal germ cells. The 
biogenesis and mechanism of action of piRNAs is poorly understood. A 
model for these processes in Drosophila melanogaster is shown in panel e. 
The precursors of piRNAs are single-stranded RNAs, because piRNA 
biogenesis does not require Dicer. The small RNAs induce reciprocal slicer- 
dependent cleavages of sense and antisense transposon transcripts. This 
process is mediated by the PIWI clade of the Argonaute family, which 
includes the proteins PIWI, Aubergine (AUB) and Argonaute 3 (AGO3) 
in D. melanogaster. PIWI- or AUB-mediated slicing of sense transcripts 
generates sense piRNAs, which associate with AGO3 and direct the 
slicing of antisense transposon transcripts. The slicing products give rise 
to antisense piRNAs, which in turn bind to PIWI and AUB and guide the 
slicing of sense transposon transcripts to generate sense piRNAs. 
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The RNaselll family 

Cleavage of dsRNA by enzymes of the RNasellI family, including Drosha 
and Dicer, yields products with characteristic termini, with a mono- 
phosphate group at the 5’ ends, and a two-nucleotide overhang at the 
3’ ends’*. The simplest RNaselII enzymes are found in prokaryotes and 
certain fungi. These enzymes consist of an RNaselII domain, which has 
the catalytic activity, and (generally) a dsRNA-binding domain (dsRBD) 
(Fig. 1a), and they function as homodimers". The two RNaselII domains 
of the dimer associate to form a single processing centre, with each cata- 
lytic domain responsible for the hydrolysis of one strand in the duplex”. 
By contrast, both Drosha and Dicer are monomeric and contain two 
tandemly arranged RNasellI domains and a single dsRBD. 

From the crystal structure of RNasellII from the prokaryote Aquifex 
aeolicus in complex with a cleaved dsRNA product”® (Fig. 1b), it can be 
seen that homodimerization of the catalytic domains creates a shal- 
low surface cleft separating the two active sites. A cluster of conserved 
acidic amino-acid residues in each catalytic centre coordinates a single 
magnesium ion. On substrate binding, the enzyme rearranges such that 
the dsRBDs clamp the dsRNA substrate over the surface of the catalytic 
domain dimer. 


Dicer 

The endonuclease Dicer processes dsRNA substrates (long dsRNAs and 
pre-miRNAs) into short dsRNA fragments (siRNAs and miRNAs) of 
defined length, typically ~21-25 nucleotides’. In addition to two copies 
of the conserved RNaselII domain and a dsRBD in the carboxyl terminus, 
Dicer enzymes usually have an amino-terminal DEXD/H-box domain, 
followed by a small domain of unknown function (the DUF283 domain) 
and a PAZ domain (Fig. 1a). The PAZ domain, which is also found in 
Argonaute proteins, binds specifically to the 3’ end of single-stranded 
RNA (ssRNA)”””. 

The crystal structure of Dicer from the unicellular eukaryote Giardia 
intestinalis revealed that the ability of Dicer enzymes to produce dsRNA 
fragments of specific length originates from a unique spatial arrangement 
of the PAZ domain and the RNaselII domains”. G. intestinalis Dicer isa 
naturally ‘trimmed-dowr version of the enzyme, consisting only of the 
PAZ domain and the tandem RNaselII domains (RNaselIIa and RNasellIIb) 
(Fig. 1a). Its structure resembles an axe, with the two RNaselIl catalytic 
domains forming the blade and the PAZ domain making up the base of 


a RNaselll dsRBD 
Bacterial RNaselll Pro-rich Arg+Ser-rich Es 
| RNasellla RNaselllb 
Human Drosha Saas 
DEXD/H box DUF283 PAZ Connector helix 
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Figure 1 | Structure of RNaselll-family enzymes. a, A schematic 
representation of the domain structure of RNaselII-family enzymes is shown. 
b, The induced fit during dsRNA binding to prokaryotic RNaselIII enzymes, 
which are homodimeric class I RNaselII enzymes, is shown. The crystal 
structure of RNasellII from the bacterium Thermotoga maritima in the RNA- 
free state (Protein Data Bank (PDB) identity 1O0W) is shown (left). RNaselII 
from the bacterium Aquifex aeolicus is shown bound to a cleaved dsRNA 
product (PDB identity 2EZ6) (right). The colours of the protein domains 
match those in panel a; the RNA is shown in gold. Magnesium ions, which 


the handle (Fig. 1c). The RNaselIIIa domain and the PAZ domain are 
connected by a long helix running the length of the handle. This connec- 
tor helix is buttressed by a platform domain formed by the N-terminal 
segment of the protein. The RNaselII domains form an intramolecular 
dimer that closely resembles the homodimeric structure of prokaryotic 
RNaselll'® (Fig. 1b). In Dicer, four conserved acidic amino-acid residues 
in the active site of each RNaseIIJ domain coordinate two metal cations, 
suggesting that Dicer uses a two-metal-ion mechanism to catalyse RNA 
cleavage. The 17.5 A distance between the two metal-ion pairs in the two 
active sites matches the width of the major groove in a dsRNA duplex. 
Modelling the binding ofa dsRNA substrate to the enzyme reveals that 
the duplex runs along a flat surface formed by the platform domain and 
makes electrostatic interactions with a number of positively charged 
residues. Mutation of the sequence encoding these residues impairs the 
catalytic activity of Dicer, underscoring the importance of these positively 
charged residues for substrate binding”. 

The PAZ domain of Dicer has the same fold and 3’-overhang-binding 
residues as the PAZ domain of Argonaute proteins (discussed in the next 
section)’. The distance between the 3’-overhang-binding pocket of the 
PAZ domain and the active site of the RNaselIIa domain is 65 A, which 
corresponds to the length of a 25-nucleotide RNA duplex (Fig. 1c). 
This is in good agreement with the size of siRNA fragments produced 
by G. intestinalis Dicer in vitro. The domain architecture of Dicer thus 
suggests that it functions as a molecular ruler, generating products of 
defined length by anchoring the 3’ dinucleotide of the substrate RNA 
duplex (generated by an initial nonspecific cleavage) in the PAZ domain 
and cleaving at a fixed distance from that end. This is supported by the 
observation that a truncated G. intestinalis Dicer protein that lacks the 
PAZ domain yields RNA products of variable length in vitro’. The struc- 
ture of G. intestinalis Dicer suggests that the length of the non-conserved 
connector helix is the main determinant of product size, providing a 
possible explanation for the variation in product size across species. 

The structure of a fragment of mouse Dicer comprising solely the 
RNaselIIb domain and dsRBD was recently solved and indicates that on 
substrate binding, the dsRBD undergoes a conformational change analo- 
gous to that observed in prokaryotic RNaselIII enzymes™. In addition, 
a truncated human Dicer protein lacking the dsRBD has a significantly 
decreased rate of RNA substrate cleavage in vitro, but its substrate affin- 
ity remains unaffected”. Most Dicer enzymes contain a DEXD/H-box 


Platform 
2 (DUF283) 


G. intestinalis Dicer-dsRNA model 


G. intestinalis Dicer 


are present at the active sites, are shown as purple spheres. Cleavage sites are 
indicated by arrows. From these structures, it is clear that the dsRNA-binding 
domains (dsRBDs; blue) undergo a marked rotation on substrate binding. 

c, The molecular mechanism of dsRNA cleavage by Giardia intestinalis Dicer 
is shown. The crystal structure of Dicer (PDB identity 2FFL) is shown (left), 
together with a model of a dsRNA substrate bound to Dicer (right). In this 
model, docking of the 3’ overhang of the RNA substrate in the PAZ domain 
leads to cleavage 65 A from the 3’ end. Images were generated from files from 
the PDB using PyMol (http://www.pymol.org). 
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Figure 2 | Modular architecture of Argonaute proteins. a, Eukaryotic 
Argonaute proteins have four domains: the N-terminal, PAZ, MID and 
PIWI domains. In some cases, notably for the structures of Archaeoglobus 
fulgidus Piwi protein, the MID domain and PIWI domain have been 
referred to as domain A and domain B, respectively. b, A crystal structure 
of Pyrococcus furiosus Argonaute (Ago) soaked with Mn™* (PDB 

identity 1Z25) is shown. The protein adopts a bilobate architecture, with 
the N-terminal and PAZ domains forming one lobe and the MID and PIWI 
domains forming the other. The metal ion in the active site is shown as a 
purple sphere. The amino-acid residues involved in metal-ion binding in 
the slicer catalytic site are shown in stick format. 


domain. These domains are found in a diverse group of proteins that 
are involved in the ATP-dependent binding and remodelling of nucleic 
acids”®. Although some invertebrate Dicer proteins (such as Drosophila 
melanogaster DCR-2) seem to require ATP for processive dicing of long 
dsRNAs (that is, multiple rounds of cleavage without dissociation from 
the dsRNA substrate), mammalian Dicer proteins seem to be ATP inde- 
pendent’***””**, Kinetic analysis of wild-type and mutant human Dicer 
proteins showed that the DEXD/H-box domain might have an auto- 
inhibitory function, because removal of this domain increases the cleav- 
age rate”. This finding suggests that the DEXD/H-box domain imposes 
on the protein a non-productive conformation that must be rearranged 
before catalysis can occur. Further structural and biochemical studies of 
full-length Dicer proteins and their substrate complexes will be neces- 
sary to establish whether the DEXD/H-box domain participates in the 
binding and unwinding of RNA duplexes during the loading of small 
RNAs into the RISC. 


Drosha and the microprocessor complex 

The RNaselII-family member Drosha catalyses the initial processing 
of pri-miRNAs, yielding pre-miRNAs, which are hairpins with phos- 
phorylated 5’ ends and 3’ dinucleotide overhangs’. Drosha is a nuclear 
protein, and its domain structure consists of a proline-rich region and 
an arginine- and serine-rich region at the N terminus, followed by 
two RNasellI domains and a dsRBD (Fig. 1a). Purified Drosha cleaves 
dsRNA nonspecifically; specific cleavage of pri-miRNAs requires asso- 
ciation with a protein known as DGCR8 (also known as PASHA in 
invertebrates) in a complex called the microprocessor”*’. DGCR8 binds 
to the base of the pri-miRNA hairpin, positioning Drosha to cleave 
the pri-miRNA stem at a distance of 11 base pairs from the junction 
between the duplex stem and the flanking ssRNA regions”. Thus, 
DGCR8 seems to be a trans-acting specificity determinant, analogous 
to the PAZ domain of Dicer, which acts in cis. The molecular architec- 
ture of Drosha is unknown, but the structure ofa core region of human 
DGCR&8, composed of a tandem pair of dsRBDs, was determined 
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recently”. The canonical RNA-binding surfaces of the two dsRBDs are 
non-contiguous, suggesting that the protein binds to two discontinuous 
segments of the pri-miRNA stem-loop structure. 


Structural insights into Argonaute function 

The common feature of RNAi and all related small-RNA-mediated 
silencing pathways is the association of a small silencing RNA (also 
known as a guide RNA in this context) with a protein of the Argonaute 
family*. The resultant protein-RNA complex forms the minimal core 
of the effector complex known as the RISC. Within the RISC, the small 
RNA functions as a sequence-specific guide that recruits an Argonaute 
protein to complementary target transcripts through base-pairing 
interactions. The target transcripts, typically mRNAs, are then either 
cleaved or prevented from being translated by ribosomes, leading to 
their degradation. 

Throughout evolution, the Argonaute family has diverged into 
specialized clades (or subfamilies) that recognize different small RNA 
types and confer the specific effects of the various small-RNA silen- 
cing pathways’’. Both siRNAs and miRNAs associate with members 
of the AGO clade of Argonaute proteins, whereas piRNAs bind to those of 
the PIWI clade. In classic RNAi, which is elicited by siRNAs, Argonaute 
proteins silence targeted mRNAs by catalysing their endonucleolytic 
cleavage, a process known as slicing. The PIWI clade of the Argonaute 
protein family is thought to use slicing in pik NA-mediated silencing 
of mobile genetic elements in the germ line**”*. During slicing, the tar- 
get RNA is cleaved at the scissile phosphate group, which is opposite 
the phosphate group between the tenth and eleventh nucleotides of the 
guide RNA strand, as measured from the 5’ end of the guide strand’. 
Slicing requires perfect complementarity between the guide strand and 
the target around the cleavage site*”**. Argonaute proteins can also 
silence transcripts independently of slicer activity. In the animal miRNA 
pathway, Argonaute proteins repress target mRNAs by inhibiting their 
translation and inducing deadenylation and subsequent mRNA decay. 
However, the precise mechanism of miRNA-mediated silencing is not 
fully understood”. 

To function as an effector of small-RNA-mediated silencing, the 
Argonaute protein must bind to the guide RNA strand, eject the non- 
guide (passenger) strand of the siRNA or miRNA-miRNA* duplex 
(where miRNA” is the passenger strand) during loading, and sub- 
sequently recognize the target RNA (Box 1). In silencing pathways that 
rely on RNA slicing, the Argonaute protein carries out multiple cycles 
of target binding, cleavage and product release, while the guide strand 
remains bound to the protein. In the metazoan miRNA pathway, in 
which silencing is achieved in a slicer-independent manner, the Argo- 
naute protein must remain tightly associated with the targeted mRNA 
to keep its translation repressed. 


Functional domains 

Argonaute proteins are multidomain proteins that contain an N-terminal 
domain, and PAZ, middle (MID) and PIWI domains (Fig. 2a). Crystal 
structures of prokaryotic Argonaute proteins have revealed a bilobate 
architecture, with the MID and PIWI domains forming one lobe, and 
the N-terminal and PAZ domains constituting the other (Fig. 2b). The 
two signature domains of the Argonaute family — the PAZ domain and 
the C-terminal PIWI domain — were originally identified by phylo- 
genetic sequence analysis“’. Three-dimensional structures of isolated 
PAZ domains from D. melanogaster Argonaute proteins revealed a fold 
similar to those of the oligosaccharide/oligonucleotide-binding (OB)- 
fold domain and Sm-fold domain’””. The first crystal structure of a full- 
length Argonaute protein, from the archaeal species Pyrococcus furiosus, 
showed that the sequence motif originally defined as the PIWI domain 
by Lorenzo Cerutti et al.” consists of two structural domains, termed 
MID and PIWI, joined by way of an extensive conserved interface that 
is centred on the buried C terminus of the protein“ (Fig. 2b). The MID 
domain resembles the sugar-binding domain of the lac repressor. The 
PIWI domain adopts a fold similar to that in RNaseH, an endoribo- 
nuclease that cleaves RNA-DNA hybrids”. Crystal structures of a 
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PIWI-like protein (called Piwi) from the archaeal species Archaeoglobus 
fulgidus (consisting of domain A and domain B, which are equivalent 
to the MID domain and the PIWI domain, respectively) and of a full- 
length Argonaute protein from the bacterium A. aeolicus also showed the 
presence of an RNaseH-like fold in their PIWI domains**“*. Biochemical 
studies suggest that, similarly to RNaseH, prokaryotic Argonaute pro- 


teins function as DNA-guided ribonucleases**, unlike their eukaryotic 


counterparts, which have evolved to use RNA molecules as guides. 


Slicer activity 

The finding that the PIWI domain of an Argonaute protein adopts 
an RNaseH fold suggested immediately that Argonaute proteins are 
responsible for the ‘slicer’ activity of the RISC”. Similarly to the require- 
ments for RNaseH activity, RISC-catalysed RNA cleavage requires 
divalent metal ions and yields a 5’ product, which has a free 3’ hydroxyl 
group, and a 3’ product, which carries a 5’ phosphate group** *. The 
active site of RNaseH contains an Asp-Asp-Glu/Asp motif, which coor- 
dinates the two divalent cations required for catalysis”. Similarly, in the 
crystal structure of P furiosus Argonaute (Ago) soaked with manga- 
nese ions, a conserved Asp-Asp-His motif was observed to coordinate 
a single manganese ion” (Fig. 2b). Mutagenesis of this motif in human 
AGO2 confirmed its requirement for slicer activity in vivo and in vitro, 
thus establishing the Argonaute protein as the catalytic component of 
RISCs**”. Similar analysis of Schizosaccharomyces pombe (fission yeast) 
Agol, acomponent of the RITS, showed that its slicer activity is required 
for transcriptional silencing”. Of the four human Argonaute proteins 
(AGO1, AGO2, AGO3 and AGO4), only AGO2 has demonstrable slicer 
activity**'. The motifs in AGO1 and AGO4 do not precisely conform 
to the consensus sequence, but AGO3 has a complete Asp-Asp-His 
sequence and yet is inactive in vitro. Moreover, it has recently been 
shown that D. melanogaster PIWI (one of the three members of the 
PIWI clade of Argonaute proteins in D. melanogaster), which has an 
Asp-Asp-Lys motif, is catalytically active”, in accordance with the 
postulated requirement for slicing in the piRNA pathway™*”’. D. mela- 
nogaster AGO1 and AGO2 have complete Asp-Asp-His motifs, and 
both are capable of slicing. But AGO1 is a much less efficient enzyme 
than AGO2 because it releases products at a slower rate, resulting in a 
slower turnover”. These findings indicate that factors in addition to 
the conservation of catalytic residues might determine whether a given 
Argonaute protein is an efficient slicer in vivo. 


Guide strand 


Human AGO1 PAZ-RNA complex 


Figure 3| Recognition of the termini of small RNAs by the PAZ and MID 
domains of Argonaute proteins. a, The crystal structure of the PAZ domain 
of human AGO1 (ribbon format) is shown in complex with an siRNA- 

like duplex (stick format) (PDB identity 1S13). Conserved residues in 
contact with the 3’ nucleotide are shown in stick format and labelled. The 
3’ end of the siRNA inserts into a preformed hydrophobic pocket, with the 
base of the 3’ nucleotide stacking against an invariant aromatic residue 
(the phenylalanine at position 292 in the protein, Phe 292). b, The crystal 
structure of Archaeoglobus fulgidus Piwi bound to an RNA duplex 

(PDB identity 2BGG), which mimics the guide-target interaction, is 


Recognition of RNA termini 

The incorporation of siRNAs and miRNAs into the RISC requires the 
presence of 5’ phosphate groups and 3’ dinucleotide overhangs at the ter- 
mini”. The discovery that the PAZ domain is an RNA-binding domain 
that specifically recognizes the 3’ ends of ssRNAs suggested immedi- 
ately that it might function as a module for anchoring the 3’ end of the 
guide RNA strand within the RISC'”"”. Insights into the mechanism of 
RNA recognition came from the crystal structure of the PAZ domain 
of human AGO1 in complex with an siRNA-like duplex” and from 
nuclear magnetic resonance structures of the PAZ domain of D. mel- 
anogaster AGO2 in complex with ssRNA oligonucleotides”. In these 
structures, the 3’ dinucleotide inserts into a preformed hydrophobic 
pocket that is lined with conserved aromatic residues (Fig. 3a). Although 
there are no sequence-specific contacts, the base of the terminal nucleo- 
tide stacks against the aromatic ring of a conserved phenylalanine 
residue (Fig. 3a). 

The 5’ phosphate groups of siRNAs and miRNAs, which result from 
their mechanism of biogenesis, are crucial for the efficient assembly 
of these small RNAs into the RISC’*’***“*. Moreover, the 5’ phosphate 
group is essential for slicing fidelity, because the position of the cleav- 
age site in the target RNA strand is determined by its distance from the 
5’ phosphate group of the guide RNA strand’*”™*. Crystal structures of 
the A. fulgidus Argonaute protein Piwi bound to a short dsRNA duplex 
(which mimics the interaction between the guide strand and the target 
strand within the RISC) showed that the 5’ nucleotide of the guide strand 
is distorted and does not base-pair with the target strand of the RNA 
duplex*** (Fig. 3b). This is consistent with the observation that a base 
mismatch at this position is tolerated and can increase slicer activity”. 
The 5’ phosphate group of the guide RNA strand is buried in a deep 
pocket at the interface between the MID domain and the PIWI domain 
and is bound to a magnesium ion that is, in turn, coordinated to the pro- 
tein’s C terminus (Fig. 3c). Mutation of any of the four residues involved 
in metal-ion coordination and 5’-phosphate binding, which are among 
the most highly conserved residues in the Argonaute protein family, 
impairs slicing activity*. This underscores the functional importance 
of the 5’-phosphate-binding pocket in anchoring the guide RNA. 

In the plant Arabidopsis thaliana, distinct types of small non-coding 
RNA associate with different Argonaute proteins on the basis of the 
identity of the 5’ nucleotide”. A. thaliana AGO1 binds mainly to 
RNAs with a uridine at their 5’ end, whereas AGO? recruits RNAs with 
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J Guide strand 


PIWI (Domain B) 


C terminus 


A. fulgidus Piwi-RNA complex 


shown. A. fulgidus Piwi is composed of a MID domain and a PIWI 

domain (also referred to as domain A and domain B, respectively). The 

5’ nucleotide of the guide strand (backbone shown in red) binds at the 
MID-PIWI domain interface and does not base-pair with the target strand 
(blue). c, Shown is a detailed view of the pocket in A. fulgidus Piwi that 
binds to the 5’ end of small RNAs. A magnesium ion, coordinated by the 

C terminus of the protein (Leu 427) and the 5’ phosphate group, is shown 
as a purple sphere. Conserved residues that are involved in metal-ion 
coordination and 5'-phosphate binding are shown in stick format 

and labelled. 
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a5’ adenosine. Domain-swap experiments involving chimaeric proteins 
showed that the specificity of the 5’ nucleotide is conferred by the MID 
and PIWI domains of the Argonaute proteins”. Similarly, the PIWI 
clade of the Argonaute protein family associates with piRNAs, which 
are ~24-31-nucleotide RNA species that are 2’-O-methylated and have 
a bias for uridine as the first nucleotide”. It is thus probable that the 
PAZ and MID domains of Argonaute proteins that bind to different 
small RNA species underwent an evolutionary adaptation that allows 
small RNAs to be sorted on the basis of particular 3’ and 5’ ends. 


RISC loading 

In the siRNA- and miRNA-mediated pathways of gene silencing, the 
loading of a guide RNA onto an Argonaute protein is closely linked with 
its biogenesis (Box 1). Starting from an siRNA duplex or a miRNA- 
miRNA” generated by Dicer-mediated cleavage, the strand with its 
5’ end at the less thermodynamically stable end of the duplex is selected 
as the guide strand®’. The observation, made from the structure of 
A. fulgidus Piwi bound to an siRNA-like duplex, that the first nucleotide 
of the guide strand is unpaired and buried in the 5’-phosphate-binding 
pocket suggests that this mode of guide-strand binding contributes 
to the apparent guide-strand selectivity’. Argonaute proteins also 
promote guide-strand selection by subsequently slicing the passenger 
strand, thereby facilitating its release” ©. However, for duplexes with 
multiple mismatches, as is often the case in the miRNA pathway, sli- 
cing is not required during loading”. Thus, Argonaute proteins lacking 
slicer activity (such as human AGO1, AGO3 and AGO4) can still be 
loaded with miRNAs. 

RISC loading takes place in the context of the RISC-loading complex. 
In human cells, the RISC-loading complex consists of an Argonaute 
protein, Dicer and the dsRBD-containing protein TRPB® ©. In vitro, this 
ternary complex can process a pre-miRNA, load the correct guide strand 
and cleave a target RNA, all in the absence of ATP”. What are the roles of 
dsRBD-containing proteins in substrate selection and RISC loading? In 
D. melanogaster, DCR-1 associates with the dsRBD-containing protein 
Loquacious (LOQS) to convert pre-miRNAs into miRNA-miRNA* 
duplexes®”'. By contrast, a complex of DCR-2 with its dsRBD-con- 
taining protein partner, R2D2, is required for the efficient production of 
siRNAs from long dsRNA precursors and for their subsequent loading 
onto AGO2 (refs 63, 72-74). Recent studies indicate that the structure 
and the extent of base-pairing (that is, the presence of mismatches) 
within siRNA and miRNA precursor duplexes determine whether the 
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T. thermophilus Ago-guide DNA 


Figure 4 | Mechanism of guide- and target-strand recognition by 
Argonaute proteins. a, The crystal structure of Thermus thermophilus 
Ago (ribbon format) bound to a 5’-phosphorylated 21-nucleotide guide 
DNA strand (stick format) (PDB identity 3DLH) is shown. Nucleotides 
1-11 and 18-21 of the guide DNA are visible in the structure and shown 
in stick format. The 5’ phosphate group of the guide strand binds to the 
MID domain, whereas the 3’ end is anchored in the PAZ domain. b, The 
crystal structure of a ternary complex of T. thermophilus Ago witha 
5’-phosphorylated 21-nucleotide guide DNA strand and a 20-nucleotide 
target RNA (PDB identity 3F73) is shown. The complex is shown in the 
same orientation as the binary complex in panel a, after superposition of 
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guide strands associate with AGO1 or AGO? in D. melanogaster**”*. The 
affinity of DCR-2-R2D2 is higher for perfectly matched duplexes than 
for ‘bulged’ duplexes (such as pre-miRNA hairpins), and this seems to 
be the main source of small RNA selectivity during AGO2 loading”. 


Target recognition 

In the A. fulgidus Piwi-RNA complex structure, the guide-target duplex 
is positioned over a conserved basic channel that spans the surfaces of 
both the MID domain and the PIWI domain**® (Fig. 3c). Importantly, 
these structures show that the bases of nucleotides 2-6 of the guide 
strand (starting from the 5’ end) are exposed and free to base-pair with 
a target mRNA. This agrees well with numerous computational and 
biochemical studies indicating that nucleotides 2-8 of the guide strand 
(known as the ‘seed’ region) are crucial determinants of the specificity 
of target recognition by miRNAs’”’”*. Modelling the trajectory ofa full- 
length siRNA-target duplex places the scissile phosphate group of the 
target RNA in the putative slicer catalytic site, thus providing a rationale 
for the specificity of slicer cleavage at a fixed distance from the 5’ end of 
the guide strand''””. Perfect complementarity around the cleavage site 
in the guide-target duplex is a prerequisite for slicing****. Presumably, 
base-pairing around the 10-11-base step ensures correct orientation of 
the scissile phosphate group in the active site. 

In the context of the bilobate architecture of full-length Argonaute pro- 
teins, the guide-target duplex has been proposed to bind ina positively 
charged cleft between the PAZ-N-terminal and MID-PIWI lobes". 
Recent crystal structures of full-length Thermus thermophilus Argo- 
naute protein bound to guide DNA strands have identified the molecular 
details of guide-strand recognition”. The structure of T: thermophilus 
Ago in complex with a 5’-phosphorylated 21-nucleotide DNA shows 
the guide strand bound with its 5’ phosphate group in the MID domain 
and with its 3’ end in the PAZ domain” (Fig. 4a). Through interactions 
with conserved arginine residues, nucleotides 2-10 of the guide strand 
adopta stacked helical conformation. Thus, the seed region of the guide 
strand is pre-organized to initiate base-pairing with the target strand 
(Fig. 4a). In the absence of a target strand, the guide strand is kinked at 
the 10-11-base step, indicating that a further structural rearrangement 
occurs on target-RNA recognition. 

The most recent insights into the mechanism of target-RNA rec- 
ognition come from the crystal structure of a ternary complex con- 
sisting of T. thermophilus Ago, a 21-nucleotide guide DNA strand 
and a 20-nucleotide target RNA with mismatched bases introduced 
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the PIWI domains. The target RNA (blue) and guide DNA (grey backbone) 
are shown in stick format. The arrow indicates the conformational change 
undergone by the lobe containing the PAZ and N-terminal domains on 
target-RNA binding. c, Putative models for target-RNA recognition by 
Argonaute proteins are illustrated. The fixed-end model postulates that 
both the 5’ and 3’ ends of the guide strand remain docked in their binding 
pockets during slicing. The two-state model postulates that the seed 
region of the guide occurs in two steps: first, nucleation of base-pairing 
with the target RNA; and second, propagation of the guide-target 

duplex, leading to the release of the 3’ end of the guide strand from 

the PAZ domain. 
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at the 10-11-base step* (Fig. 4b). Both termini of the guide strand 
are anchored in their respective binding pockets, as is the case for the 
T. thermophilus Ago-guide DNA complex. The seed region of the guide 
strand (nucleotides 2-8) engages in Watson—Crick base-pairing inter- 
actions with the target RNA, assuming an A-form helical conform- 
ation. To accommodate the target RNA strand in the central channel, 
Ago undergoes a pronounced conformational change to a more open 
conformation, achieved by rotating the N-terminal and PAZ domains 
away from the lobe containing the MID and PIWI domains. 

Two models for target recognition and cleavage by Argonaute proteins 
have been proposed*"*’. The fixed-end model postulates that both ends 
of the guide RNA remain bound to the Argonaute protein during slicing. 
This presents a topological constraint on the guide-target interaction 
that would limit the extent of base-pairing to less than one helical turn 
(11 base pairs) and might be the factor that limits the seed region to 
nucleotides 2-8 of the guide RNA. By contrast, the two-state model 
proposes that the target binds to the seed region of the guide strand, and 
then propagation of base-pairing towards the 3’ end of the guide strand 
results in this end of the guide strand dissociating from the PAZ domain. 
At present, it is unclear whether the structure of the T: thermophilus Ago 
ternary complex represents a cleavage-competent complex as envisaged 
by the fixed-end model or whether it depicts a trapped intermediate as 
postulated by the two-state model. Further structural studies of catalyti- 
cally inactive Argonaute proteins in complexes with perfectly matched 
guide-target duplexes will be necessary to clarify this issue. Nonetheless, 
the mode of target-RNA recognition observed in the T: thermophilus 
Ago ternary complex might be representative of target mRNA recogni- 
tion during miRNA-mediated silencing in metazoans, in which slicing 
is typically prevented by a base mismatch between the miRNA and the 
target at around the 10-11-base step. 


Slicer-independent functions 
In metazoans, gene silencing by miRNAs occurs by the mere anchor- 
ing of amiRNA-containing RISC to the target mRNA, in the absence 
of slicer-dependent cleavage of the target mRNA®. The mechanism of 
miRNA-mediated repression is unclear, but it must involve interac- 
tions between the RISC and the cellular machineries that are respon- 
sible for translation and mRNA decay. Recent studies using in vitro 
systems indicate that miRNAs might affect the initiation of translation 
by interfering with the function of the EIF4F, a complex that binds to 
the mRNA cap (7-methylguanosine)**. The MID domain of human 
AGO2 shows limited sequence homology to the cap-binding motif of 
the translation initiation factor EIF4E, a subunit of EIF4E, suggesting 
that human AGO2 competes with EIF4E for binding to the mRNA 
cap structure®’. However, the MID domain of Argonaute proteins lacks 
any discernible structural homology to EIF4E. Thus, the AGO2-cap 
interaction, if indeed direct, might be a non-canonical mode of mRNA 
cap recognition. 

Argonaute proteins, miRNAs and their mRNA targets co-localize in 
P bodies, which are cytoplasmic foci where MRNA decay is thought 
to occur. In mammalian cells and in D. melanogaster, the interaction 
between Argonaute proteins and the P-body protein GW 182 is required 
for miRNA-mediated repression of mRNAs, as well as for Argonaute 
localization**™. It was shown recently that the glycine- and tryptophan- 
rich (GW) motifs of GW 182 directly interact with the MID-PIWI 
region of human Argonaute proteins and that a minimal fragment 
(termed the Ago hook), encompassing two tandem GW motifs, is suf- 
ficient to mediate this interaction”. Interestingly, mutations that disrupt 
the Argonaute-GW 182 interaction have been mapped mainly to the 
5'-phosphate-binding pocket located at the interface between the MID 
domain and PIWI domain®”. 


Future prospects 

Structural studies and structure-based biochemical studies will con- 
tinue to improve our understanding of the mechanisms and evolution- 
ary relationships of RNAi pathways. An important future goal will be to 
determine the structures of some of the proteins and protein complexes 


that operate in eukaryotic cells. Elucidating the molecular architecture 
of the RISC-loading complex will bring important insights into the 
coupling of small RNA biogenesis, Argonaute loading and guide-strand 
selection. Recent proteomic studies have uncovered a multitude of Arg- 
onaute-interacting proteins (for example GW182, MOV 10 and FXR1) 


that might link Argonaute proteins to downstream effector events”’””. 


Obtaining structural and structure-based functional insights into these 
interactions will help to uncover the molecular mechanisms that under- 
lie the functions of Argonaute proteins in small-RNA-mediated gene 
silencing. Results from these studies will not only provide new mecha- 
nistic details but also lay the foundation for the informed engineering 
of RNAias a therapeutic tool. a 
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Small RNAs in transcriptional gene 
silencing and genome defence 


Danesh Moazed' 


Small RNA molecules of about 20-30 nucleotides have emerged as powerful regulators of gene expression 
and genome stability. Studies in fission yeast and multicellular organisms suggest that effector complexes, 
directed by small RNAs, target nascent chromatin-bound non-coding RNAs and recruit chromatin- 
modifying complexes. Interactions between small RNAs and nascent non-coding transcripts thus reveal 
anew mechanism for targeting chromatin-modifying complexes to specific chromosome regions and 
suggest possibilities for how the resultant chromatin states may be inherited during the process of 


chromosome duplication. 


RNA interference (RNAi) originally referred to the ability of exogenously 
introduced double-stranded RNA (dsRNA) molecules to silence the 
expression of homologous sequences in the nematode Caenorhabditis 
elegans’. It has become clear over the past decade that RNAi is mecha- 
nistically related to a number of other conserved RNA silencing path- 
ways, which are involved in the cellular control of gene expression and 
in protection of the genome against mobile repetitive DNA sequences, 
retroelements and transposons’ *. These RNA silencing pathways are 
all associated with small (~20-30 nucleotide) RNAs that function as 
specificity factors for inactivating homologous sequences by a variety 
of mechanisms. At least three classes of small RNA have been iden- 
tified so far (Table 1). The first two classes, short interfering RNAs 
(siRNAs) and microRNAs (miRNAs), are ~21-25 nucleotides and are 
generated from longer dsRNA precursors by Dicer, a ribonuclease III 
(RNasellI) enzyme. They are loaded into the RNA-induced silencing 
complex (RISC) or a nuclear form of RISC, called the RNA-induced 
transcriptional silencing complex (RITS)*"®. RISC and RITS are effector 
complexes that are targeted to homologous sequences by base-pairing 
interactions involving the guide strand of the small RNA. The core 
component of each complex is a highly conserved PAZ- and PIWI- 
domain-containing protein called Argonaute, which binds to the guide 
small RNA by means of interactions that involve its PAZ domain, as well 
as the PIWI and middle (MID) domains, and cleaves the target RNA 
by means of its RNaseH-like PIWI domain (see page 405 for further 
information about the structural biology of RNAi proteins). 

The Argonaute family of proteins, together with the small RNAs that 
program them, are the central players in RNA silencing, and seem to 
participate in all small-RNA silencing pathways thus far described. 
Phylogenetically, Argonaute-family proteins are divided into the AGO 
and PIWI clades"’. The PIWI-clade proteins bind to a third class of small 
RNAs, called PIW1-interacting RNAs (piRNAs), which have a broader 
average size (~24—31 nucleotides) than siRNAs and miRNAs and are 
involved in defence against parasitic DNA elements’*”*. As discussed 
later, piRNA-programmed PIWI-clade proteins are also likely to func- 
tion as RISC- and RITS-like complexes that target the inactivation of 
homologous sequences (‘Table 1). With the notable exception of bud- 
ding yeast, small-RNA-mediated silencing mechanisms and their role in 
chromatin regulation are conserved throughout eukaryotes, indicating 
an ancient evolutionary origin. 


This Review discusses the roles of diverse small-RNA silencing 
pathways in the regulation of chromatin structure and transcription 
in plants, animals and fungi, with particular emphasis on emerging 
common themes. In addition to their well-known roles in post-tran- 
scriptional gene silencing (PTGS), in which silencing is directed at the 
level of messenger RNA translation or stability, nearly all small-RNA 
silencing pathways also seem to act at the DNA and chromatin level 
(Table 1). Studies in Schizosaccharomyces pombe (fission yeast) and other 
organisms suggest that small RNAs access DNA through interactions 
with nascent RNA transcripts, revealing a close relationship between 
nuclear and cytoplasmic RNA silencing mechanisms. Moreover, small- 
RNA silencing pathways seem to be intimately integrated with the RNA 
surveillance and processing pathways that determine the ultimate fate of 
RNA transcripts. Together, these studies reveal a broad and previously 
unsuspected role for RNAi and other RNA-processing mechanisms in 
the regulation of the structure and expression of eukaryotic genomes. 
Here, I discuss small-RNA silencing pathways and their role in chro- 
matin regulation, drawing parallels between well-established examples 
in S. pombe and other organisms. 


RNA silencing pathways 

RNA silencing pathways can be broadly classified into different 
branches based on their mechanism of action, subcellular location and 
the origin of the small RNA molecules that they use (Table 1). How- 
ever, the different branches have common components and intersect 
in some instances. siRNAs act in both the nucleus and the cytoplasm 
and are involved in PTGS and chromatin-dependent gene silencing 
(CDGS). CDGS refers to both transcriptional gene silencing (TGS) 
and co-transcriptional gene silencing (CTGS)*. miRNAs are gener- 
ated from hairpin precursors by the successive actions of the RNasellI 
enzymes Drosha and Dicer, which are located in the nucleus and cyto- 
plasm, respectively (see page 396 for a more detailed discussion of small 
RNA precursor processing and complex assembly). Although Drosha 
is absent in plants, the general features of the miRNA pathway are 
conserved in plants and animals, but not in fungi and other protozoa. 
Whereas the vast majority of miRNAs seem to act exclusively in the 
cytoplasm and mediate mRNA degradation or translational arrest”, 
some plant miRNAs may act directly in promoting DNA methylation”. 
Furthermore, recent studies describe a role for promoter-directed 
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Table 1| Conservation of small-RNA silencing pathways in eukaryotes 


Small RNA Size (nucleotides) Mechanism of action Eukaryotes 
conserved in 

siRNA ~21-25 PTGS (RNA degradation Plants, animals, 
or translational arrest) fungi, ciliates 
CDGS 

miRNA ~21-25 PTGS (RNA degradation Plants, animals 
or translational arrest) 
CDGS (toa lesser extent) 

piRNA ~24-31* PTGS (RNA degradation) Animals 
CDGS (to a lesser extent) 

All three of the major RNA silencing pathways identified thus far seem to act in both post- 


transcriptional gene silencing (PTGS) and chromatin-dependent gene silencing (CDGS) pathways. 
CDGS refers to chromatin-dependent silencing events that involve the assembly of small RNA 
complexes on nascent transcripts and includes both transcriptional gene silencing (TGS) and 
co-transcriptional gene silencing (CTGS) events. The latter involves the chromatin-dependent 
processing or degradation of the nascent transcript. *Caenorhabditis elegans piRNAs are 

21 nucleotides. 


human miRNAs in facilitating repressive chromatin modifications and 
TGS”, siRNAs are generated from long dsRNA precursors, which can 
be produced from a variety of single-stranded RNA (ssRNA) precur- 
sors. These precursors include sense and antisense RNAs transcribed 
from convergent promoters, which can anneal to form dsRNA, and 
hairpin RNAs that result from transcription through inverted repeat 
regions” *° (Fig. 1a). In some situations the long dsRNA is produced 
enzymatically from certain aberrant or non-coding RNA precursors. 
One example of this pathway involves aberrant RNAs that lack process- 
ing signals or are produced by Argonaute slicer activity. These RNAs 
recruit RNA-dependent RNA polymerase (RdRP) enzymes, which rec- 
ognize free 3’ ends and synthesize dsRNA””*” (Fig. 1b, c). Here RARP 
enzymes are in competition with the TRAMP polyadenylation pathway, 
which targets aberrant RNAs for degradation by a 3'->5’ exonuclease 
complex, called the exosome**'(Fig. 1b). The siRNA branch of the 
pathway seems to be conserved from fungi to mammals (Table 1), 
although Drosophila melanogaster (fruitflies) and mammals lack RdRPs 
and cannot amplify siRNAs. 

piRNAs originate from a diversity of sequences, including repetitive 
DNA and transposons, and like siRNAs they seem to act at both the 
post-transcriptional and chromatin levels’ *. The mechanism(s) that 
generates and amplifies piRNAs is not yet fully elucidated but involves the 
slicer activity of the PIWI-clade proteins themselves* (Fig. 1d). This class 
of small RNAs is present in D. melanogaster, C. elegans and mammals, 
but seems to be absent in fungi and plants (Table 1). 


Small RNAs in DNA and chromatin regulation 

An accumulating body of evidence supports an important role for small 
RNAs in the modulation of chromatin structure and TGS in plants, 
fungi and animal cells. RNA silencing was first linked to TGS by the 
discovery that transgene and viral RNAs guide the methylation of 
homologous DNA sequences in plants*’. Analysis of the guide RNAs 
in Arabidopsis thaliana revealed that these RNAs were processed into 
small RNAs of ~25 nucleotides, similar to the size previously described 
for miRNAs™”. This observation and the realization that exogenously 
introduced dsRNA in animals is processed into siRNAs* established 
small RNAs as central players in diverse RNA silencing pathways. Later 
studies in A. thaliana indicated that RNA-directed DNA methylation 
of the FWA transgene requires Dicer (DCL3) and Argonaute (AGO4), 
and is linked to histone H3 lysine 9 (H3K9) methylation, indicat- 
ing that RNA-directed DNA methylation and RNAi have common 
molecular mediators***. 

Evidence for the role of RNA silencing in mediating changes at the 
chromatin level also came from studies of silent or heterochromatic 
DNA domains in unicellular eukaryotes, such as S. pombe and the cili- 
ate Tetrahymena thermophila. S. pombe contains single genes encoding 
the Argonaute, Dicer and RdRP proteins, called ago1, dcr1 and rdp1, 
respectively. Deletion of any of these genes results in loss of hetero- 
chromatic gene silencing, markedly reduced H3K9 methylation at 
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centromeric repeats, and accumulation of non-coding RNAs, which 
are transcribed from centromeric repeat regions and processed into 
siRNAs”’*. Moreover, RNAi is directly linked to a structural compo- 
nent of heterochromatin through RITS, which in S. pombe contains 
Agol, the chromodomain protein Chp1, the glycine and tryptophan 


(GW)-motif-containing protein Tas3 and centromeric siRNAs'”””””. 


T. thermophila cells are binucleate with a germline micronucleus and 
a somatic macronucleus. Development of a new macronucleus after 
sexual conjugation and meiosis involves massive DNA elimination 
of non-genic sequences. This elimination requires TWII, a T. ther- 
mophila PIW1-clade protein, and PDD1, a chromodomain protein that 
binds to both K9- and K27-methylated histone H3 (refs 40-42). In 
addition, DNA elimination is associated with Dicer-produced small 
RNAs, called scan RNAs (scnRNAs), giving rise to the idea that a scn- 
RNA RITS-like complex targets sequences destined for elimination 
into heterochromatin”’. However, a physical association between 
chromatin proteins and TWI1 has not yet been reported. 

RNAiis also linked to chromatin modifiers in animal cells. In D. mel- 
anogaster, the introduction of multiple tandem copies of a transgene 
results in silencing of both the transgene array and the endogenous 
copies. This repeat-induced gene silencing, which is analogous to RNA- 
mediated co-suppression in plants’, requires components of the Poly- 
comb group (PcG) of genes, as well as several RNAi factors, including 
PIWI and AGO? (ref. 43). The PcG gene products are chromatin-binding 
and -modifying repressors that prevent the expression of homeobox 
(HOX) regulators outside their proper domains of expression™*. The 
requirement for both PcG proteins and PIW1 in transgene silencing 
suggested the possibility that in D. melanogaster, as in plant cells, RNA 
silencing could operate at the chromatin level. In fact, later studies 
showed that RNA silencing factors are also required for the formation 
of D. melanogaster centric heterochromatin, recruitment of hetero- 
chromatin protein 1 (HP1) and silencing of transgenes that are inserted 
in pericentromeric heterochromatin®””. In addition to HP1 and PIWI, 
efficient silencing requires DCR-1, PIWI, Aubergine (AUB) and the 
putative helicase HLS (also known as SPN-E)**. Moreover, silencing of 
a mini-white gene, which is mediated by a cis-acting repeated element 
from the heterochromatic Y chromosome, requires HP1, SU(VAR)3-9 
(the H3K9 methyltransferase), as well as PIWI, AUB, HLS and DCR-1 
(ref. 46). Transgene-induced gene silencing in C. elegans has also been 
shown to require RNAi and chromatin modifiers”. Surprisingly, 
screens for defects in classical RNAi, mediated by feeding of dsRNA, 
have also uncovered several chromatin modifiers, suggesting that per- 
haps the connection between RNAi and chromatin modifiers may not 
be limited to repeat-induced silencing”. 

In contrast to their apparent requirement for PcG-mediated repeat- 
induced gene silencing, RNAi components do not seem to be required 
for PcG-mediated silencing of HOX genes outside their proper domains 
of expression”. Mutations in several RNA silencing factors disrupt the 
silencing of a tandem mini-white gene array and perturb the nuclear 
clustering of PcG-repressed HOX loci”. However, despite their require- 
ment for PcG-mediated repeat-induced silencing, loss of PIWI and 
RNAi components does not lead to a loss of HOX gene silencing. The 
simplest explanation for these observations is that RNAi is required for 
some, but not all, PcG-mediated silencing events. 


Linking heterochromatin to RNAi 

Heterochromatin is associated with repetitive DNA sequences and 
transposons, and has important roles in chromosome transmission, 
maintenance of genomic stability, and regulation of gene expression”. 
With the exception of budding yeast, which lacks centromeric DNA 
repeats, heterochromatin is concentrated at repeats and transposons 
that surround centromeres, telomeres and other genomic loci (Fig. 2a). 
Two important defining properties of heterochromatin involve its 
modes of assembly and inheritance. First, heterochromatin assembly 
involves nucleation sites, which act as entry points for the recruitment 
and spreading of repressor proteins. Unlike recruitment, which involves 
the action of a site-specific DNA-binding protein or RNA molecule, 
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spreading occurs in a sequence-independent manner and involves 
changes in chromatin structure that are mediated by histone-modifying 
enzymes. The second defining property of heterochromatin is its mode 
of inheritance. Once assembled, heterochromatin is inherited through 
many cell divisions, at least partly independently of the underlying DNA 
sequence. The mechanisms of spreading and epigenetic inheritance 
of heterochromatin are poorly understood, but, in S. pombe, require 
components of the RNAi pathway*?***. 

At the molecular level, heterochromatin is characterized by association 
with hypoacetylated histones and, in organisms ranging from S. pombe to 
humans, by association with H3K9 dimethylation and trimethylation*”. 
H3K9 is methylated by SU(VAR)3-9 in D. melanogaster, SUV39H in 
humans and Clr4 in S. pombe, and creates a binding site for HP1 (Swi6é 
and Chp?2 in S. pombe)”. HP1 proteins contain a chromodomain that 
binds to methylated H3K9 and a chromoshadow (CSD) domain, which 
is involved in other protein-protein interactions™. 

Biochemical isolation of S. pombe heterochromatin and RNAi com- 
plexes has provided direct physical links between heterochromatin 
and RNAi proteins, leading to models for how RNAi mediates hetero- 
chromatin assembly and participates in gene silencing. In addition to 
HP!1 proteins, heterochromatic gene silencing in S. pombe requires the 
chromodomain protein Chp1 (ref. 59). Chp1 is larger than HP1 and, 
like the Polycomb (Pc) subfamily of chromodomains, contains only 
a single chromodomain at its amino terminus. Like Swié and Chp2, 
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Figure 1 | Pathways of RNA processing and biogenesis of small RNAs. 

a, Generation of endogenous siRNAs from dsRNA resulting from 
convergent transcription (sense-antisense RNA base-pairing; top) or 
transcription through inverted repeat sequences (hairpin RNA formation; 
bottom). TER, transcription termination signal. b, Processing of non- 
coding and aberrant RNAs by the RDRC and TRAMP complexes, 
containing the Cid12 and Cid14 non-canonical polyadenylation 
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Chp1 is a structural component of heterochromatin and is required for 
heterochromatic gene silencing”. Unlike Swié and Chp2, which are not 


required for H3K9 methylation within centromeric repeat regions”, 


a lack of Chp1 in cells leads to a marked loss of H3K9 methylation, 
indicating that Chp1 has a critical role in heterochromatin formation. 

Biochemical purification of Chp1 showed that it is associated with 
Agol in RITS"®. RITS acts as a specificity determinant for the recruit- 
ment of other RNAi complexes and chromatin-modifying enzymes 
to specific DNA regions. RITS also physically associates with and is 
required for recruitment of the RNA-directed RNA polymerase complex 
(RDRC) to non-coding RNAs that are transcribed from centromeric 
repeats®’, RDRC contains the S. pombe RNA-directed RNA polym- 
erase, Rdp1, a putative helicase termed Hrr1, and Cid12, a member of 
the Trf4 and Trf5 family of polyadenylation polymerases”, which were 
first identified in the budding yeast Saccharomyces cerevisiae and are 
involved in the degradation of aberrant transcripts*””*’. The physical 
association of RITS and RDRC is siRNA- and Clr4-dependent, suggest- 
ing that this association occurs on chromatin and requires histone H3K9 
methylation”. These observations further suggest that RITS and RDRC 
localize to chromatin-bound nascent RNA bya mechanism that involves 
tethering the nascent transcript to chromatin via the bivalent complex 
RITS (Fig. 2). In addition to RITS, S. pombe contains a second Agol- 
containing complex, named Argonaute siRNA chaperone (ARC) com- 
plex®’. The Agol protein in the ARC complex contains duplex, rather 
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polymerases, respectively; the RDRC/Dicer pathway produces duplex 
siRNAs, whereas the TRAMP/exosome pathway produces single-stranded 
degradation products. c, Generation of a free 3’ end by the slicer activity 
of an Argonaute protein, which can be processed into dsRNA by RdRP or 
targeted for degradation by the exosome (not shown). d, Pathway for the 
generation of piRNAs by the PIWI clade of Argonaute proteins: PIWI, 
AUB and AGO3. 
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Figure 2 | Chromosome organization and the 
nascent transcript model for heterochromatic 
gene-silencing assembly in Schizosaccharomyces 


Telomere pombe. a, The structure of S. pombe centromeric 
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than single-stranded, siRNA, indicating that the slicer activity of Agol 
(refs 63, 64), which is required for the release of the siRNA passenger 
strand, is inhibited in this complex” 2 


Nascent transcripts as assembly platforms 

In principle, siRNAs in RITS can base-pair with either unwound DNA 
regions or with nascent non-coding RNAs that are transcribed from 
their target DNA. The two models are not mutually exclusive and base- 
pairing with DNA and RNA may contribute to different aspects of the 
mechanism of siRNA biogenesis and function. However, although a 
role for siRNA-DNA base-pairing cannot be ruled out at this point, 
several lines of evidence support siRNA-RNA base-pairing inter- 
actions in which the siRNA targets nascent non-coding transcripts 
(Fig. 2). First, RITS associates with the RDRC, which uses ssRNA as a 
template to synthesize dsRNA, providing evidence that RITS itself is 
RNA-associated®'. Furthermore, the RITS-RDRC interaction requires 
siRNA and the Clr4 H3K9 methyltransferase, suggesting that it occurs 
on heterochromatin-bound transcripts”. Together with the observ- 
ation that proteins required for heterochromatin formation — such as 
Sir2, Swi6, Clr4 and other components of the Clr4 methyltransferase 
complex (CLRC), as well as RITS and RDRC — are required for siRNA 
accumulation’***-®, these studies suggest that siRNA-programmed 
RITS localizes to nascent chromatin-tethered non-coding transcripts 
and recruits the RDRC to initiate dsRNA synthesis and siRNA amplifi- 
cation (Fig. 2b). Direct support for this model comes from experiments 
in which the Tas3 component of RITS was fused to the phage A N 
(AN) protein and tethered to the transcript of a euchromatic ura4* 
gene, which was modified with the addition of five AN-binding sites 
upstream of its transcription termination sequences (ura4-5BoxB)”. 
In cells containing ura4-5BoxB, the Tas3-AN protein could efficiently 
initiate de novo siRNA generation and heterochromatin formation™. 
Like the situation at centromeres, siRNA generation in this system is 
H3K9 methylation-dependent, suggesting that Tas3-AN associates with 
chromatin-bound nascent transcripts and initiates RNAi-mediated het- 
erochromatin assembly. Finally, several splicing factors associate with 
RDRC*® and are required for RNAi-mediated centromeric gene silen- 
cing. These results provide additional support for co-transcriptional 
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is methylated on lysine 4 (green lollipops) in 
euchromatic regions. b, The nascent transcript 
model for heterochromatin assembly. The RITS 
is tethered to chromatin through base-paring 
interactions between siRNAs and nascent 
non-coding transcripts and interactions with 
H3K9-methylated nucleosomes, resulting in the 
recruitment of RDRC-Dicer, dsRNA synthesis 
and siRNA amplification. This RNAi positive- 
feedback loop then recruits the CLRC H3K9 
methyltransferase. Efficient silencing also 
requires two HP1 proteins (Swi6 and Chp2), 
which promote the association of RITS with 

the non-coding RNA or mediate TGS through 
recruitment of the SHREC2 deacetylase complex, 
respectively. Another tier of regulation, involving 
the degradation of heterochromatic transcripts by 
the TRAMP/exosome pathway, further ensures 
full gene silencing. Blue arrows (bottom) highlight 
convergent transcription resulting in synthesis of 
sense and antisense RNAs, which may contribute 
to the production of trigger siRNAs. 
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processing of non-coding centromeric RNAs, as spliceosomal compo- 
nents are known to associate with nascent RNA transcripts co-tran- 
scriptionally. A role for the nascent transcript in acting as a template for 
the recruitment of chromatin-modifying activities may be conserved 
throughout eukaryotes. For example, large non-coding RNAs such as 
XIST, which is involved in X-chromosome inactivation, are thought to 
be involved in the recruitment of histone and DNA methyltransferase 
enzymes”. However, the mechanism of recruitment of chromatin- 
modifying activities to XIST may involve site-specific RNA-binding 
proteins rather than small RNAs. 


Chromatin-dependent processing of siRNAs 

A remarkable observation in studies of RNAi in S. pombe is that the 
generation of centromeric siRNAs is a heterochromatin-dependent 
event®’. In the nascent transcript model (Fig. 2b), RITS associates 
with methylated H3K9 through the chromodomain of its Chp1 com- 
ponent and captures the nascent non-coding transcript through base- 
pairing interactions involving siRNAs bound to its Agol protein. In 
cells lacking the H3K9 methyltransferase Clr4 or any component of 
the CLRC, the levels of centromeric siRNAs are greatly diminished”. 
Moreover, one of the two HP1 proteins, Swi6, is required for efficient 
siRNA generation®’” and the association of RDRC with centromeric 
DNA repeats” and non-coding centromeric RNAs”. Furthermore, 
the crucial chromatin-dependent step in siRNA generation involves 
dsRNA synthesis by RDRC, as the introduction of a long dsRNA- 
containing hairpin into S. pombe cells circumvents the requirement 
for both RDRC and Clr4 in siRNA generation”. These results suggest 
that RDRC is only able to synthesize dsRNA on chromatin-bound 
templates after it has been recruited by RITS, revealing the existence of 
a chromatin-dependent step in the activation of the dsRNA biogenesis 
and siRNA amplification pathway in S. pombe. The resultant dsRNA 
is processed into siRNA by Dcrl, which is also physically tethered to 
RDRC”. Heterochromatin regulation of small RNA production may be 
conserved in metazoans. X-TAS (transposable P elements inserted in 
telomeric-associated sequences on the X chromosome) and flamenco, 
two major piRNA-producing loci that control the transposition of 
Pand gypsy elements in D. melanogaster, respectively, are embedded 
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in heterochromatin, and their genome defence function requires both 
PIWI and HP1 (refs 73, 74). 


siRNA-mediated initiation of chromatin silencing 

An important question regarding the role of RNA in gene silencing 
is whether small RNAs can initiate de novo chromatin modifications. 
Although small RNAs are important components of some CDGS 
mechanisms, their ability to initiate chromatin modifications seems 
to be under strict control by other mechanisms. In S. pombe, ectopic- 
ally produced hairpin siRNAs can initiate H3K9 methylation and gene 
silencing at only a subset of target loci”. siRNA-mediated CDGS cor- 
relates with chromosomal location and the occurrence of antisense 
transcription at the targeted locus, and requires overexpression of 
the Swié (HP1) protein”. This may be reflecting the importance of 
cooperativity in the recruitment of RITS and other Argonaute or PIWI 
effector complexes to chromatin. In addition to siRNAs, stable asso- 
ciation of RITS with chromatin requires the binding of the chromo- 
domain in Chp1 to H3K9-methylated nucleosomes'™”. In the absence 
of H3K9 methylation, the initial binding of RITS to chromatin may 
be inefficient. Swi6 overproduction may help in initial RITS binding 
by stabilizing low levels of H3K9 methylation that occur throughout 
the genome, or alternatively by tethering the nascent transcript at the 
target locus to chromatin’! (Fig. 2b). Similar limitations may explain 
the context-dependent ability of siRNAs to promote DNA methylation 
in plants”, as well as the observed variability in siRNA-mediated chro- 
matin modifications in animal cells (for example, see refs 76, 77). The 
ability of siRNAs to act as initiators is reminiscent of the role of DNA- 
binding transcription factors in the regulation of transcription, which 
often involves cooperativity between two or more transcription factors 
and is sensitive to local chromatin structure. 


Small RNAs and epigenetic inheritance 

Mechanisms that mediate the cis-inheritance of chromatin states and 
their associated gene-expression patterns remain enigmatic. It has long 
been known that during DNA replication, old parental histones are 
randomly distributed onto the two newly synthesized daughter DNA 
strands”. This retention of old histones during DNA replication has 
given rise to the idea that histone modifications mediate the epigenetic 
inheritance of chromatin states. Histone modifications, such as H3K9 
methylation, create binding sites for proteins such as Chp1, Chp2 and 
Swi6, as well as the methyltransferase Clr4 (ref. 79) (Fig. 2b). Their 
retention during DNA replication could in principle serve as a mark 
for the re-recruitment of new chromatin-modifying activities that re- 
establish old modification patterns. However, the affinity of modi- 
fied histones for specific binding proteins may be too low to allow the 
specific re-establishment of chromatin states, and other inputs into the 
mechanism are required”. The nascent transcript model, described 
above, provides a possible mechanism for epigenetic inheritance of 
heterochromatin. As in plants and other systems that contain an RdRP- 
dependent siRNA amplification mechanism”, the siRNA generation 
mechanism in S. pombe is likely to form a positive-feedback loop". 
Two specific features of this loop may underlie the mechanism that 
ensures the epigenetic inheritance of histone H3K9 methylation and 
heterochromatin. First, siRNAs can recruit H3K9 methylation to 
chromatin, possibly through physical interactions with the CLRC 
or dsRNA”’*!, Thus, so long as siRNAs corresponding to a specific 
chromatin domain are present, they can recruit H3K9 methylation 
to that domain (Fig. 2b). The second feature involves a requirement 
for H3K9 methylation and chromatin localization in activating the 
siRNA positive-feedback loop”. This ensures that siRNAs are cis- 
restricted and is central to the role of siRNAs as epigenetic maint- 
enance factors: siRNAs act only on those daughter DNA strands that 
have inherited old parental histone H3 molecules containing H3K9 
methylation. Such cooperativity-based mechanisms involving the dual 
recognition of histone marks and other specificity factors (siRNAs 
or DNA-binding proteins) are likely to underlie all epigenetic 
cis-inheritance mechanisms. 


RNAi and exosome-mediated RNA degradation 

It may seem paradoxical that RNAi, which requires transcription, is 
required for assembling heterochromatin, a state that is associated with 
gene inactivation and TGS**!. However, multiple mechanisms seem 
to ensure that transcription in heterochromatin does not result in the 
production of mature transcripts, thereby keeping heterochromatic 
genes off, despite transcription. First, heterochromatic transcripts 
are degraded or processed into siRNAs by the RNAi machinery itself 
through a process that has been referred to as CTGS or cis-PTGS** 
(Fig. 2b). CTGS requires the tethering of the RNAi machinery to 
heterochromatin by H3K9 methylation. This mechanism makes 
a major contribution to the silencing of some promoters in centro- 
meric DNA repeats, although TGS is also an important contributing 
mechanism®”""’. Second, an RNAi-independent RNA surveillance 
mechanism involving the TRAMP polyadenylation complex, which 
contains Cid14 (a Trf4/5 homologue), Airl, and Mtr1 in S. pombe, also 
targets heterochromatic transcripts for degradation”. In S. cerevisiae, 
TRAMP recognizes aberrant transcripts that lack polyadenylation sig- 
nals and targets them for degradation by the exosome, a 3’>5’ exo- 
nuclease complex***'**. The presence of another member of the Trf4 
polyadenylation polymerase family, Cid12, in the RDRC™ suggests 
that RDRC and TRAMP may compete for access to heterochromatic 
transcripts (Fig. 1b). Furthermore, TRAMP and RDRC may compete 
more broadly for RNA substrates, because in cid14 deletion cells new 
classes of RNAs become RNAi targets and are processed into siRNAs”. 
The involvement of members of the Trf4 family in RNAi processes in 
C. elegans and T. thermophila suggests a conserved role for members of 
this family in the coordination of exosome-mediated RNA surveillance 
with RNAi***>. Finally, transcription in heterochromatin is cell-cycle 
regulated and is largely restricted to the S phase of the cell cycle”. 
This transcription is associated with high levels of siRNAs during the S 
phase, which may be important for epigenetic re-establishment of his- 
tone H3K9 methylation by the RITS-RDRC-CLRC complexes. How- 
ever, it remains to be determined whether the increase in centromeric 
transcription and siRNA levels in S phase is merely a reflection of cell- 
cycle-associated changes in chromatin structure or has an important 
role in RNAi-mediated heterochromatin assembly. 

Nearly all co-transcriptional RNA-processing events studied so far, 
including pre-mRNA capping, splicing and 3’-end processing, involve 
association between components of the processing machinery and RNA 
polymerase II (Pol II). Association with the polymerase is thought to 
help ensure that processing occurs in an orderly fashion and couples 
mRNA maturation with mRNA export. In addition, these associations 
serve to couple RNA quality control with transcription, ensuring that 
only true mRNAs are exported from the nucleus for translation. There 
is evidence that RNAi-mediated co-transcriptional heterochromatin 
assembly also involves interactions with Pol II°*”**. Point mutations in 
two different Pol II subunits in S. pombe, Rpb2 and Rpb7, have been 
isolated in screens for defects in centromeric heterochromatin assem- 
bly. Neither mutation is associated with a growth defect or general per- 
turbation of transcription”, suggesting that the mutations may affect 
specific interactions with components of the RNAi machinery or the 
CLRC. Such interactions may contribute to efficient siRNA generation 
or H3K9 methylation by stabilizing the association of RITS-RDRC 
with nascent transcripts. Interestingly, in A. thaliana, RNA-dependent 
DNA methylation involves interactions between an Argonaute protein 
and RNA Pol IV, a plant-specific DNA-dependent RNA polymerase” 
(discussed below). 


Conservation of small-RNA-mediated silencing 

As discussed above, RNA silencing mechanisms have critical roles 
in endogenous chromatin-mediated processes in plants, C. elegans, 
D. melanogaster, ciliates and fungi. The role of small RNAs in chro- 
matin silencing can also be extended to mammalian cells, although 
the mechanisms and physiological pathways are not yet clear. Reports 
from several laboratories provide evidence for the occurrence of DNA 
and histone modifications, which are promoted by the introduction 
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of siRNAs or hairpin RNAs into mammalian cell lines””’”””*”'. In these 
studies, siRNAs are directed to the promoter regions of target genes 
and induce the recruitment of repressive histone marks such as H3K9 
and H3K27 methylation”, but silencing is not always associated with 
CpG methylation”*. In addition to chromatin-modifying complexes, 
siRNA-mediated TGS in mammalian cells requires AGO1 and AGO2 
when the gene encoding progesterone is targeted”, and AGO1 when 
the gene encoding the human immunodeficiency virus 1 co-receptor 
(CCR5) is targeted”’. The recent identification of endogenous siRNAs in 
D. melanogaster and mammalian cells, which map to intergenic regions 
and are produced from dsRNA resulting from antisense transcription 
or long-hairpin structures, raises the intriguing possibility that some 
of these siRNAs modulate chromatin structure”. 

The PIWI-clade proteins and their associated piRNAs have impor- 
tant roles in the control of transposons in the germline — and possibly 
somatic cells — of D. melanogaster and mammals‘. The mouse MIWI2 
member of this family is required for silencing the long interspersed 
nuclear element 1 (LINE-1) and intracisternal A particle (IAP) transpos- 
able elements in the testis, and in Miwi2 mutants both LINE-1 and IAP 
DNA is demethylated”, suggesting that piRNAs, directly or indirectly, 
mediates changes in DNA methylation. It remains unclear how the role 
of PIWI proteins in transposon silencing in the germline may be related 
to their function in repeat-induced and heterochromatic gene silencing 
in somatic cells described in D. melanogaster***. 

Although the mechanisms that link RNA to chromatin and the bio- 
chemical nature of the relevant complexes have not been defined yet, the 
available evidence allows us to draw some parallels between the nascent 
transcript model in S. pombe and other systems. The common denomi- 
nator in the RNA silencing pathways operating in genome regulation 
is the linkage of Argonaute or PIWI proteins to chromatin- or DNA- 
associated molecules (Fig. 3). Argonaute proteins associate with adaptor 
proteins containing the conserved GW motif, which binds to their PIWI 
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H3K9me in transposons 
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Figure 3 | Argonaute complexes that link RNA silencing to chromatin 
modifications. Argonaute proteins in different silencing pathways, 
including miRNA- and siRNA-mediated PTGS, are associated with 
conserved GW-motif-containing adaptor proteins, which help direct 

them to different targets. a, In many organisms, GW182 (a GW-motif- 
containing protein) or one of its homologues associates with the AGO1 
and AGO? proteins and directs them to P bodies. b, In S. pombe, Ago1 in 
the RITS is linked to heterochromatin through its association with the GW 
protein Tas3, which also binds to Chp1. Chp1 in turn associates with H3K9 
methylated nucleosomes (H3K9me) through its chromodomain (CD). 
Swi6 (a homologue of HP1) acts as an accessory factor that helps tether the 
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domain and is required for miRNA-mediated silencing” (Fig. 3a). In 
S. pombe, the GW-motif-containing protein Tas3 links Agol to Chp1 
(refs 10, 94, 95). The binding of Chp1 to a methylated nucleosome then 
serves to tether nascent non-coding RNA, which is base-paired with 
siRNA in Agol, to chromatin (Fig. 3b). This tethering seems to be cru- 
cial in that it links RNAi to chromatin and ‘activates’ the Agol-bound 
nascent transcript complex to mediate chromatin modifications”. 
A similar Argonaute tethering situation seems to exist in A. thaliana, 
where, in addition to AGO4 and DCL3, RNA-directed DNA methyl- 
ation requires the plant-specific Pol IV”. Pol IV exists as Pol TVA and 
Pol IVB complexes, which differ in their largest component, NRPD1A 
and NRPD1B, respectively. Pol IVB and AGO4 are thought to act down- 
stream of Pol IVA and DCL3, which are required for siRNA genera- 
tion, to trigger DNA methylation. NRPD1B contains a GW motif at its 
carboxyl terminus”. This GW-motif-containing domain links Pol IVB 
to AGO4, providing a parallel with the function of other GW-domain- 
containing proteins, such as the Tas3 component of RITS in S. pombe*”* 
(Fig. 3e). Thus, in plants, the strategy for coupling RNAi to chromatin 
involves a physical interaction between a repeat- or heterochromatin- 
specific RNA polymerase and an Argonaute protein. Once an siRNA- 
programmed AGO4 localizes to a nascent transcript synthesized by 
Pol IVB, it may trigger histone H3K9 and DNA methylation by recruit- 
ing the appropriate methyltransferase enzymes (Fig. 3e). 

The role of the D. melanogaster PIWI protein in repeat-induced gene 
silencing and heterochromatin assembly seems to involve a direct associ- 
ation between PIWI and HPI (ref. 100) (Fig. 3c). PIWI-HP1 may func- 
tion as a RITS that targets nascent transcripts in repeat DNA elements 
and tethers these transcripts to chromatin by means of base-pairing 
interaction between piRNAs in PIWI and the association of PIWI with 
HP1 (Fig. 3c). Unlike the case with RITS and AGO4, this tethering does 
not seem to involve a GW-domain-containing protein and is mediated 
by the HP1 CSD anda conserved CSD-binding PXVXL motif (where X 


H3K9me and DNA methylation 
in repeats (A. thaliana) 


non-coding RNA to heterochromatin. The chromoshadow domain (CSD) 
is involved in protein-protein interactions. cenDNA, centromeric repeat 
DNA. ¢, In D. melanogaster, PIW1 is targeted to heterochromatin through 
direct interactions with HP1; the association of PIWI with HP1 is mediated 
through the PXVXL motif, present in many HP1-binding proteins, rather 
than through a GW motif. d, In D. melanogaster and possibly other 
organisms, AGO1 and AGO2 have been implicated in mediating chromatin 
modifications, but the putative chromatin adaptor (CAD) protein has 

not been identified. e, In A. thaliana, AGO4 is linked to Pol IVB, which 
contains a GW motif at its carboxyl terminus and is specifically required 
for DNA methylation and silencing of heterochromatic repeats. 
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is any amino acid) present in D. melanogaster PIWI'™. The PIWI-HP1 
complex may be required for the recruitment and spreading of H3K9 
methylation or possibly for the co-transcriptional degradation of RNAs 
that may escape heterochromatic TGS. It remains to be determined 
whether this PIWI-HP1 complex acts more broadly in piRNA-mediated 
silencing of transposons in the germline. Similarly, the possible role of 
AGO1 and AGO2 in siRNA-dependent gene silencing may be mediated 
by interactions with unidentified chromatin adaptors (Fig. 3d). 


Future prospects 

RNAi and related RNA silencing pathways have emerged as new 
mechanisms for the regulation of the structure and activity of genes 
and genomes. Our understanding of the mechanisms that allow 
some small RNAs to act at the DNA and chromatin level, and restrict 
other small RNAs to mRNA regulation in the cytoplasm, is still at an 
early stage. Although accumulating evidence suggests that nuclear small- 
RNA pathways are conserved, the endogenous pathways that may use 
small RNAs for genome regulation in animal cells remain for the most 
part unknown. Another gap in our knowledge of nuclear small-RNA 
pathways in animal cells involves the biochemical identification of the 
molecular networks that link different types of small RNA to chromatin 
proteins. Whereas Argonaute and PIWI proteins, as well as small RNAs, 
have been implicated in mediating chromatin or DNA modifications, it 
remains unclear how specific chromosome regions are targeted and how 
modifying enzymes are recruited. Future studies are likely to provide 
new and surprising insights about the way in which small and large 
non-coding RNAs regulate chromatin structure and how this ability is, 
in turn, regulated. | 


1. Fire, A. etal. Potent and specific genetic interference by double-stranded RNA in 
Caenorhabditis elegans. Nature 391, 806-811 (1998). 

2. Baulcombe, D. RNA silencing in plants. Nature 431, 356-363 (2004). 

3. Buhler, M. & Moazed, D. Transcription and RNAi in heterochromatic gene silencing. Nature 
Struct. Mol. Biol. 14, 1041-1048 (2007). 

4. Aravin, A.A., Hannon, G. J. & Brennecke, J. The Piwi-piRNA pathway provides an adaptive 
defense in the transposon arms race. Science 318, 761-764 (2007). 

5. Hamilton, A.J. & Baulcombe, D. C. A species of small antisense RNA in posttranscriptional 
gene silencing in plants. Science 286, 950-952 (1999). 

6. Zamore, P.D., Tuschl, T., Sharp, P. A. & Bartel, D. P. RNAi: double-stranded RNA directs 
the ATP-dependent cleavage of mRNA at 21 to 23 nucleotide intervals. Cel! 101, 25-33 
(2000). 

7. Bernstein, E., Caudy, A. A., Hammond, S. M. & Hannon, G. J. Role for a bidentate 
ribonuclease in the initiation step of RNA interference. Nature 409, 363-366 (2001). 

8. Elbashir, S. M., Lendeckel, W. & Tuschl, T. RNA interference is mediated by 21- and 
22-nucleotide RNAs. Genes Dev. 15, 188-200 (2001). 

9. Hammond, S. M., Bernstein, E., Beach, D. & Hannon, G. J. An RNA-directed nuclease 
mediates post-transcriptional gene silencing in Drosophila cells. Nature 404, 293-296 
(2000). 

This study describes the biochemical isolation of RISC. 

0. Verdel, A. et al. RNAi-mediated targeting of heterochromatin by the RITS complex. Science 
303, 672-676 (2004). 

This study describes the purification and component identification of RITS, which 
physically links RNAi to heterochromatin. 

Tl. Carmell, M.A., Xuan, Z., Zhang, M. Q. & Hannon, G. J. The Argonaute family: tentacles that 
reach into RNAi, developmental control, stem cell maintenance, and tumorigenesis. Genes 
Dev. 16, 2733-2742 (2002). 

2. Aravin, A. etal. A novel class of small RNAs bind to MILI protein in mouse testes. Nature 
442, 203-207 (2006). 

3. Girard, A., Sachidanandam, R., Hannon, G. J. & Carmell, M.A. A germline-specific class of 
small RNAs binds mammalian Piwi proteins. Nature 442, 199-202 (2006). 

14. Grivna, S. T., Beyret, E., Wang, Z. & Lin, H. A novel class of small RNAs in mouse 
spermatogenic cells. Genes Dev. 20, 1709-1714 (2006). 

5. Lau, N.C. etal. Characterization of the piRNA complex from rat testes. Science 313, 
363-367 (2006). 

6. Watanabe, T. et al. Identification and characterization of two novel classes of small RNAs 
in the mouse germline: retrotransposon-derived siRNAs in oocytes and germline small 
RNAs in testes. Genes Dev. 20, 1732-1743 (2006). 

7. Batista, P. J. et al. PRG-1 and 21U-RNAs interact to form the piRNA complex required for 
fertility in C. elegans. Mol. Cell 31, 67-78 (2008). 

8. Das, P.P. etal. Piwi and piRNAs act upstream of an endogenous siRNA pathway to 
suppress Tc3 transposon mobility in the Caenorhabditis elegans germline. Mol. Cell 31, 
79-90 (2008). 

9. Filipowicz, W., Jaskiewicz, L., Kolb, F. A. & Pillai, R. S. Post-transcriptional gene silencing by 
siRNAs and miRNAs. Curr. Opin. Struct. Biol. 15, 331-341 (2005). 

20. Bao,N., Lye, K. W. & Barton, M. K. MicroRNA binding sites in Arabidopsis class II| HD-ZIP 
mRNAs are required for methylation of the template chromosome. Dev. Cell 7, 653-662 
(2004). 

21. Gonzalez, S., Pisano, D. G. & Serrano, M. Mechanistic principles of chromatin remodeling 
guided by siRNAs and miRNAs. Cell Cycle 7, 2601-2608 (2008). 


22. 


23: 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33. 


34. 


35: 


36. 


37; 


38. 


39. 


40. 


Al. 


42. 


43. 


44. 


45. 


46. 


47. 


48. 


49. 


50. 


5; 


52. 


53: 


54. 
55. 


56. 


Kim, D.H., Saetrom, P., Snove, O. Jr & Rossi, J. J. MicroRNA-directed transcriptional gene 
silencing in mammalian cells. Proc. Natl Acad. Sci. USA 105, 16230-16235 (2008). 
Ghildiyal, M. et al. Endogenous siRNAs derived from transposons and mRNAs in Drosophila 
somatic cells. Science 320, 1077-1081 (2008). 

Chung, W.J., Okamura, K., Martin, R. & Lai, E. C. Endogenous RNA interference provides a 
somatic defense against Drosophila transposons. Curr. Biol. 18, 795-802 (2008). 

awamura, Y. et al. Drosophila endogenous small RNAs bind to Argonaute 2 in somatic 
cells. Nature 453, 793-797 (2008). 

Dalmay, T., Hamilton, A., Rudd, S., Angell, S. & Baulcombe, D. C. An RNA-dependent RNA 
polymerase gene in Arabidopsis is required for posttranscriptional gene silencing mediated 
by a transgene but not by a virus. Cell 101, 543-553 (2000). 

Sijen, T. et al. On the role of RNA amplification in dsRNA-triggered gene silencing. Cell 107, 
465-476 (2001). 
Buhler, M., Haas, W., Gygi, S. P. & Moazed, D. RNAi-dependent and -independent RNA 
urnover mechanisms contribute to heterochromatic gene silencing. Cel! 129, 707-721 
(2007). 
This study identified a role for co-transcriptional RNA processing, mediated by the 
TRAMP/exosome pathway, as an additional layer of regulation that is required for 
efficient heterochromatic gene silencing. 

Buhler, M., Spies, N., Bartel, D. P. & Moazed, D. TRAMP-mediated RNA surveillance 
prevents spurious entry of RNAs into the Schizosaccharomyces pombe siRNA pathway. 
Nature Struct. Mol. Biol. 15, 1015-1023 (2008). 

Wyers, F. et al. Cryptic Pol Il transcripts are degraded by a nuclear quality control pathway 
involving a new poly(A) polymerase. Cell 121, 725-737 (2005). 

LaCava, J. et al. RNA degradation by the exosome is promoted by a nuclear polyadenylation 
complex. Cell 121, 713-724 (2005). 

Wassenegger, M., Heimes, S., Riedel, L. & Sanger, H. L. RNA-directed de novo methylation 
of genomic sequences in plants. Cell 76, 567-576 (1994). 

Mette, M. F., Aufsatz, W., van der Winden, J., Matzke, M. A. & Matzke, A. J. Transcriptional 
silencing and promoter methylation triggered by double-stranded RNA. EMBO J. 19, 
5194-5201 (2000). 

These two studies described a role for RNA in promoting the methylation of homologous 
genomic sequences in plants. 

Zilberman, D., Cao, X. & Jacobsen, S. E. ARGONAUTE4 control of locus-specific siRNA 
accumulation and DNA and histone methylation. Science 299, 716-719 (2003). 
Henderson, |. R. et al. Dissecting Arabidopsis thaliana DICER function in small RNA 
processing, gene silencing and DNA methylation patterning. Nature Genet. 38, 721-725 
(2006). 

Lippman, Z., May, B., Yordan, C., Singer, T. & Martienssen, R. Distinct mechanisms 
determine transposon inheritance and methylation via small interfering RNA and histone 
modification. PLoS Biol. 1, e67 (2003). 

Volpe, T. A. et al. Regulation of heterochromatic silencing and histone H3 lysine-9 
methylation by RNAi. Science 297, 1833-1837 (2002). 

This study identified a role for components of the RNAi pathway in heterochromatin 
assembly and gene silencing at S. pombe centromeres. 

Reinhart, B. J. & Bartel, D. P. Small RNAs correspond to centromere heterochromatic 
repeats. Science 297, 1831 (2002). 

Cam, H. P. et al. Comprehensive analysis of heterochromatin- and RNAi-mediated 
epigenetic control of the fission yeast genome. Nature Genet. 37, 809-819 (2005). 
Mochizuki, K., Fine, N. A., Fujisawa, T. & Gorovsky, M. A. Analysis of a piwi-related gene 
implicates small RNAs in genome rearrangement in Tetrahymena. Cell 110, 689-699 
(2002). 
This study identified a role for T. thermophila TWI1, a PIWI-clade Argonaute protein, 
and small RNAs in mediating DNA elimination. 
Taverna, S. D., Coyne, R. S. & Allis, C. D. Methylation of histone H3 at lysine 9 targets 
programmed DNA elimination in Tetrahymena. Cell 10, 701-711 (2002). 

Liu, Y. et al. RNAi-dependent H3K27 methylation is required for heterochromatin 

ormation and DNA elimination in Tetrahymena. Genes Dev. 21, 1530-1545 (2007). 
Pal-Bhadra, M. et al. Heterochromatic silencing and HP1 localization in Drosophila are 
dependent on the RNAi machinery. Science 303, 669-672 (2004). 

Ringrose, L. & Paro, R. Epigenetic regulation of cellular memory by the Polycomb and 
Trithorax group proteins. Annu. Rev. Genet. 38, 413-443 (2004). 

Deshpande, G., Calhoun, G. & Schedl, P. Drosophila argonaute-2 is required early in 
embryogenesis for the assembly of centric/centromeric heterochromatin, nuclear division, 
nuclear migration, and germ-cell formation. Genes Dev. 19, 1680-1685 (2005). 

Haynes, K. A., Caudy, A. A., Collins, L. & Elgin, S. C. Element 1360 and RNAi components 
contribute to HP1-dependent silencing of a pericentric reporter. Curr. Biol. 16, 2222-2227 
(2006). 

Grishok, A., Sinskey, J. L. & Sharp, P. A. Transcriptional silencing of a transgene by RNAi in 
he soma of C. elegans. Genes Dev. 19, 683-696 (2005). 

Robert, V. J., Sijen, T., van Wolfswinkel, J. & Plasterk, R. H. Chromatin and RNAi factors 
protect the C. elegans germline against repetitive sequences. Genes Dev. 19, 782-787 
(2005). 
im, J. K. et al. Functional genomic analysis of RNA interference in C. elegans. Science 308, 
164-1167 (2005). 

Grimaud, C. et al. RNAi components are required for nuclear clustering of Polycomb group 
response elements. Cel! 124, 957-971 (2006). 

Grewal, S. |. & Elgin, S. C. Transcription and RNA interference in the formation of 
heterochromatin. Nature 447, 399-406 (2007). 

Folco, H. D., Pidoux, A. L., Urano, T. & Allshire, R. C. Heterochromatin and RNAi are required 
to establish CENP-A chromatin at centromeres. Science 319, 94-97 (2008). 

Zaratiegui, M., Irvine, D. V. & Martienssen, R. A. Noncoding RNAs and gene silencing. Cell 
128, 763-776 (2007). 

Grewal, S. |. & Jia, S. Heterochromatin revisited. Nature Rev. Genet. 8, 35-46 (2007). 

Rea, S. et al. Regulation of chromatin structure by site-specific histone H3 
methyltransferases. Nature 406, 593-599 (2000). 

Lachner, M., O'Carroll, D., Rea, S., Mechtler, K. & Jenuwein, T. Methylation of histone H3 
lysine 9 creates a binding site for HP1 proteins. Nature 410, 116-120 (2001). 


419 


© 2009 Macmillan Publishers Limited. All rights reserved 


57. 


58. 


59. 


60. 


61. 


62. 


420 


. Mello, C.C. & Conte, D. 


Bannister, A. J. et al. Selective recognition of methylated lysine 9 on histone H3 by the HP1 
chromo domain. Nature 410, 120-124 (2001). 

Nakayama, J., Rice, J.C., Strahl, B. D., Allis, C. D. & Grewal, S. |. Role of histone H3 lysine 

9 methylation in epigenetic control of heterochromatin assembly. Science 292, 110-113 
(2001). 

Partridge, J. F., Borgstrom, B. & Allshire, R. C. Distinct protein interaction domains and 
protein spreading in a complex centromere. Genes Dev. 14, 783-791 (2000). 

Sadaie, M.., lida, T., Urano, T. & Nakayama, J. A chromodomain protein, Chp1, is required for 
the establishment of heterochromatin in fission yeast. EMBO J. 23, 3825-3835 (2004). 
Motamedi, M. R. et al. Two RNAi complexes, RITS and RDRC, physically interact and 
localize to noncoding centromeric RNAs. Cell 119, 789-802 (2004). 

Sugiyama, T., Cam, H., Verdel, A., Moazed, D. & Grewal, S. |. RNA-dependent RNA 
polymerase is an essential component of a self-enforcing loop coupling heterochromatin 
assembly to siRNA production. Proc. Natl Acad. Sci. USA 102, 152-157 (2005). 

These two studies identified the S. pombe RdRP complex and demonstrated that it has 
invitro dsRNA synthesis activity and is associated with RITS and non-coding centromeric 
RNA. They further showed that the dsRNA synthesis activity is required for RNAi- 
mediated gene silencing. 

Buker, S. M. et al. Two different Argonaute complexes are required for siRNA generation and 
heterochromatin assembly in fission yeast. Nature Struct. Mol. Biol. 14, 200-207 (2007). 


. Irvine, D. V. et al. Argonaute slicing is required for heterochromatic silencing and spreading. 


Science 313, 1134-1137 (2006). 

Noma, K. et al. RITS acts in cis to promote RNA interference-mediated transcriptional and 
post-transcriptional silencing. Nature Genet. 36, 1174-1180 (2004). 

Buhler, M., Verdel, A. & Moazed, D. Tethering RITS to a nascent transcript initiates RNAi- 
and heterochromatin-dependent gene silencing. Cell 125, 873-886 (2006). 

This study demonstrated that tethering of the Tas3 component of RITS to the RNA 
transcript of a euchromatic reporter gene induced RNAi and heterochromatin assembly 
at the reporter gene. 

Hong, E.-J.E., Villen, J., Gerace, E. L., Gygi, S. P. & Moazed, D. A Cullin E3 ubiquitin ligase 
complex associates with Rik] and the Clr4 histone H3-K9 methyltransferase and is 
required for RNAi-mediated heterochromatin formation. RNA Biol. 2, 106-111 (2005). 
Bayne, E. H. et al. Splicing factors facilitate RNAi-directed silencing in fission yeast. Science 
322, 602-606 (2008). 

Wutz, A. RNAs templating chromatin structure for dosage compensation in animals. 
Bioessays 25, 434-442 (2003). 

lida, T., Nakayama, J. & Moazed, D. siRNA-mediated heterochromatin establishment 
requires HP1 and is associated with antisense transcription. Mol. Cell 31, 178-189 (2008). 
Motamedi, M. R. et al. HP1 proteins form distinct complexes and mediate heterochromatic 
gene silencing by non-overlapping mechanisms. Mol. Cell 32, 778-790 (2008). 
Colmenares, S. U., Buker, S. M., Buhler, M., Dlakic, M. & Moazed, D. Coupling of double- 
stranded RNA synthesis and siRNA generation in fission yeast RNAi. Mol. Cell 27, 449-461 
(2007). 

Sarot, E., Payen-Groschene, G., Bucheton, A. & Pelisson, A. Evidence for a piwi-dependent 
RNA silencing of the gypsy endogenous retrovirus by the Drosophila melanogaster flamenco 
gene. Genetics 166, 1313-1321 (2004). 

Reiss, D., Josse, T., Anxolabehere, D. & Ronsseray, S. aubergine mutations in Drosophila 
melanogaster impair P cytotype determination by telomeric P elements inserted in 
heterochromatin. Mol. Genet. Genomics 272, 336-343 (2004). 

Chan, S. W., Zhang, X., Bernatavichute, Y. V. & Jacobsen, S. E. Two-step recruitment of 
RNA-directed DNA methylation to tandem repeats. PLoS Biol. 4, e363 (2006). 

Ting, A. H., Schuebel, K. E., Herman, J. G. & Baylin, S. B. Short double-stranded RNA 
induces transcriptional gene silencing in human cancer cells in the absence of DNA 
methylation. Nature Genet. 37, 906-910 (2005). 
anowski, B. A. et al. Involvement of AGO1 and AGO2 in mammalian transcriptional 
silencing. Nature Struct. Mol. Biol. 13, 787-792 (2006). 

Sogo, J. M., Stahl, H., Koller, T. & Knippers, R. Structure of replicating simian virus 40 
minichromosomes. The replication fork, core histone segregation and terminal structures. 
J. Mol. Biol. 189, 189-204 (1986). 
Jenuwein, T. & Allis, C. D. Translating the histone code. Science 293, 1074-1080 (2001). 

r. Revealing the world of RNA interference. Nature 431, 338-342 


(2004). 


81. 


82. 


100. 


Zhang, K., Mosch, K., Fischle, W. & Grewal, S. |. Roles of the Clr4 methyltransferase 
complex in nucleation, spreading and maintenance of heterochromatin. Nature Struct. Mol. 
Biol. 15, 381-388 (2008). 

Chen, E. S. etal, Cell cycle control of centromeric repeat transcription and heterochromatin 
assembly. Nature 451, 734-737 (2008). 

Vanacova, S. et al. A new yeast poly(A) polymerase complex involved in RNA quality 
control. PLoS Biol. 3, e189 (2005). 

Chen, C. C. etal. A member of the polymerase B nucleotidyltransferase superfamily is 
required for RNA interference in C. elegans. Curr. Biol. 15, 378-383 (2005). 

Lee, S. R. & Collins, K. Physical and functional coupling of RNA-dependent RNA 
polymerase and Dicer in the biogenesis of endogenous siRNAs. Nature Struct. Mol. Biol. 14, 
604-610 (2007). 

Kloc, A., Zaratiegui, M., Nora, E. & Martienssen, R. RNA interference guides histone 
modification during the S phase of chromosomal replication. Curr. Biol. 18, 490-495 
(2008). 

Kato, H. et al. RNA polymerase II is required for RNAi-dependent heterochromatin 
assembly. Science 309, 467-469 (2005). 

Djupedial, |. et al. RNA Pol Il subunit Rpbb7 promotes centromeric transcription and RNAi- 
directed chromatin silencing. Genes Dev. 19, 2301-2306 (2005). 

El-Shami, M. et al. Reiterated WG/GW motifs form functionally and evolutionarily 
conserved ARGONAUTE-binding platforms in RNAi-related components. Genes Dev. 21, 
2539-2544 (2007). 

This study, together with references 93-95, establishes GW-motif-containing proteins as 
conserved Argonaute adaptors. 

Morris, K. V., Chan, S. W., Jacobsen, S. E. & Looney, D. J. Small interfering RNA-induced 
transcriptional gene silencing in human cells. Science 305, 1289-1292 (2004). 

Kim, D.H., Villeneuve, L.M., Morris, K. V. & Rossi, J. J. Argonaute-1 directs siRNA-mediated 
transcriptional gene silencing in human cells. Nature Struct. Mol. Biol. 13, 793-797 (2006). 
Carmell, M. A. et al. MIWI2 is essential for spermatogenesis and repression of transposons 
in the mouse male germline. Dev. Cell 12, 503-514 (2007). 

Eulalio, A., Huntzinger, E. & Izaurralde, E. GW182 interaction with Argonaute is essential 
for miRNA-mediated translational repression and mRNA decay. Nature Struct. Mol. Biol. 15, 
346-353 (2008). 
Debeauchamp, J. L. et al. Chp1-Tas3 interaction is required to recruit RITS to fission yeast 
centromeres and for maintenance of centromeric heterochromatin. Mol. Cell. Biol. 28, 
2154-2166 (2008). 
Till, S. etal. A conserved motif in Argonaute-interacting proteins mediates functional 
interactions through the Argonaute PIWI domain. Nature Struct. Mol. Biol. 14, 897-903 
(2007). 
Li, C. F. et al. An ARGONAUTE4-containing nuclear processing center colocalized with 
Cajal bodies in Arabidopsis thaliana. Cell 126, 93-106 (2006). 

Pontes, O. et al. The Arabidopsis chromatin-modifying nuclear siRNA pathway involves a 
nucleolar RNA processing center. Cell 126, 79-92 (2006). 

Kanno, T. et al. Atypical RNA polymerase subunits required for RNA-directed DNA 
methylation. Nature Genet. 37, 761-765 (2005). 

Herr, A. J., Jensen, M. B., Dalmay, T. & Baulcombe, D. C. RNA polymerase IV directs 
silencing of endogenous DNA. Science 308, 118-120 (2005). 

Brower-Toland, B. et al. Drosophila PIWI associates with chromatin and interacts directly 
with HPta. Genes Dev. 21, 2300-2311 (2007). 

This study identifies a physical association between HP1 and PIWI, suggesting that HP1 
may collaborate with PIWI in establishing repressive chromatin domains. 


Acknowledgements | thank the members of my laboratory and colleagues in 
the chromatin and RNA silencing fields for fruitful discussions, and the National 


Institutes of Health, the Leu 
Medical Institute for funding. 


emia and Lymphoma Society, and the Howard Hughes 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The author declares no competing financial interests. 
Correspondence should be addressed to the author 
(danesh_moazed@hms.harvard.edu). 


© 2009 Macmillan Publishers Limited. All rights reserved 


Viral and cellular messenger RNA 
targets of viral microRNAs 


Bryan R. Cullen’ 


Given the propensity of viruses to co-opt cellular pathways and activities for their benefit, it is perhaps not 
surprising that several viruses have now been shown to reshape the cellular environment by reprogramming 
the host's RNA-interference machinery. In particular, microRNAs are produced by the various members 

of the herpesvirus family during both the latent stage of the viral life cycle and the lytic (or productive) stage. 
Emerging data suggest that viral microRNAs are particularly important for regulating the transition from 
latent to lytic replication and for attenuating antiviral immune responses. 


MicroRNAs (miRNAs) are a class of small (~21-25 nucleotides) single- 
stranded RNAs that can inhibit the expression of specific messenger 
RNAs by binding to complementary target sequences within the mRNAs. 
Viruses were first reported to express miRNAs in 2004 (ref. 1), when 
Thomas Tuschl and colleagues described five miRNAs that were pro- 
duced in human B cells after infection with the y-herpesvirus Epstein- 
Barr virus (EBV). Subsequently, miRNAs have been found to be expressed 
by all of the herpesviruses examined. EBV is now known to encode at least 
23 miRNAs”’, and the distantly related human y-herpesvirus Kaposi’s 
sarcoma-associated herpesvirus (KSHV) encodes 12 (refs 3-5). Simi- 
larly, the B-herpesvirus human cytomegalovirus (HCMV) encodes 11 
miRNAs”*, and the human a-herpesvirus herpes simplex virus 1 (HSV-1) 
encodes at least 6 miRNAs”*. (These numbers refer to the known pre- 
miRNA precursors encoded by each virus. A pre-miRNA can give rise 
to a single mature miRNA or to two miRNAs, one of which will be more 
abundant.) Several miRNAs have also been identified in herpesviruses 
that infect other species, including the simian y-herpesviruses rhesus 
rhadinovirus’ and rhesus lymphocryptovirus’, murine y-herpesvirus 68 
(MHV68)°, murine cytomegalovirus", and the avian a-herpesviruses 
Marek’s disease virus types 1 and 2 (refs 12, 13). 

Unlike herpesviruses (which are a family of DNA viruses), other, 
unrelated, DNA viruses seem to encode either one or two miRNAs (for 
example, primate polyoma viruses and human adenoviruses) or none 
at all>'*"”, Viruses that have an RNA genome, including retroviruses 
and flaviviruses, have been reported to lack miRNAs”, although this 
result remains somewhat controversial for human immunodeficiency 
virus 1 (HIV-1)'*. The absence of viral miRNAs in the RNA viruses 
examined so far might be partly explained by the fact that if the viral 
genome contained an appropriate precursor, this might be excised by the 
miRNA-processing enzyme of the host cell (that is, by Drosha)”, result- 
ing in degradation of the viral genome. Moreover, most RNA viruses 
— as well as DNA viruses belonging to the poxvirus family — replicate 
in the cytoplasm, away from the nucleus, where the Drosha-containing 
microprocessor complex is located. Therefore, even ifthe genomes of 
these cytoplasmic viruses encode a miRNA, it is not apparent how they 
could be processed to yield a mature miRNA. 

It is less clear why miRNAs seem to be rare in nuclear-replicating 
DNA viruses that are not members of the herpesvirus family. It seems 
possible that the presence of miRNAs in herpesviruses is associated with 
the characteristic ability of herpesviruses to establish long-term latent 


infections. Avoiding the host immune response is particularly important 
during latent infection, and viral miRNAs not only have the advantage 
of not being recognized by the host immune system but also might be 
an ideal tool for attenuating immune responses by downregulating the 
expression of key cellular genes. Moreover, miRNAs might provide a 
way to regulate the entry of herpesviruses to the latent stage of the life 
cycle and/or their exit from this stage”. Other DNA virus families usu- 
ally establish productive infections that often result in the infected cell’s 
dying rapidly as a result of pathogenic factors produced by the virus or 
cytotoxic responses induced in the host. On the one hand, given that 
miRNAs operate at the level of the mRNA, they might not be as use- 
ful during a productive (lytic) replication cycle, because the proteins 
encoded by the targeted mRNAs might have a half-life that approaches, 
or even exceeds, the duration of the viral life cycle. On the other hand, 
viral miRNAs could be effective inhibitors of cellular mRNAs that are 
produced de novo during infection and that might encode proteins with 
antiviral activities. It will be interesting to see whether additional viral 
miRNAs, encoded by DNA viruses other than those of the herpesvirus 
family, will be uncovered in the future. In this Review, I briefly discuss 
what is known about the biogenesis and function of the known viral 
miRNAs, focusing on the limited number of viral and cellular mRNA 
targets that have been identified for these viral miRNAs so far. 


Viral miRNA generation 

The genomic regions encoding cellular miRNAs are generally tran- 
scribed by RNA polymerase II. The initial product is a capped, poly- 
adenylated transcript that includes one or more stem-loop structures, 
each of which contains a mature miRNA sequence as part of one arm 
(Fig. 1). This precursor is known as a primary miRNA (pri-miRNA)” 
(see page 396 for further details about miRNA biogenesis). The nuclease 
Drosha cleaves the pri-miRNA stem, excising hairpin intermediates 
of ~65-70 nucleotides known as precursor miRNAs (pre-miRNAs). 
These are exported to the cytoplasm and processed by another nuclease, 
Dicer, generating mature miRNAs of ~22 nucleotides. The miRNAs 
are loaded into a protein complex known as the RNA-induced silen- 
cing complex (RISC), which they then guide to the target mRNA to 
exert their effector function. Binding of the RISC to an mRNA bearing 
extensive sequence complementarity to the miRNA generally results in 
mRNA cleavage and degradation, whereas binding to mRNAs bearing 
partial complementarity results mainly in translational arrest. 
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At present, there is no evidence that any vertebrate virus encodes 
novel miRNA-processing factors or RISC components. So it seems that, 
in general, viral miRNAs are transcribed and processed in the same 
way as cellular miRNAs and, moreover, that RISCs programmed by 
viral miRNAs are functionally equivalent to those programmed by cel- 
lular miRNAs (Fig. 1). One exception is miRNAs encoded by MHV68 
and by human adenoviruses, because the regions of the viral genome 
encoding these miRNAs are initially transcribed by RNA polymerase III 
(refs 5, 16). Although the processing steps involved in the biogenesis of 
MHV68 miRNAs remain to be fully elucidated, mature MHV68 and 
human adenovirus miRNAs are probably excised by Dicer and loaded 
into the RISC normally. 

A key characteristic of DNA viruses is that gene expression dur- 
ing productive replication is temporally regulated: viral proteins can 
be subdivided into immediate early, early, and late species. Immedi- 
ate early gene products are usually regulatory proteins. Early proteins 
are more diverse and are often involved in viral genome replication or 
in host immune response regulation. Late proteins are usually struc- 
tural. The y-herpesviruses, in particular, also produce viral proteins 
during latency, and these proteins have roles in episome maintenance, 
cell growth regulation and immune evasion. It is important therefore 
to consider whether the expression of viral miRNAs is also temporally 
regulated. For most herpesviruses, this is unclear at present, because the 
known viral miRNAs have been cloned either exclusively from latently 


infected cells or from cells at a relatively late stage in productive repli- 
cation. However, there is evidence indicating that some viral miRNAs 
are active during latency, whereas others are more important during 
productive replication. For example, in the case of KSHV, all 12 viral 
miRNAs are derived from a cluster that is transcribed as a single pri- 
miRNA during latent infection. During the transition to productive 
replication, a viral lytic promoter is activated, and this promoter lies 3’ 
to the genomic sequences encoding ten of the KSHV miRNAs but 5’ to 
the genomic sequences encoding two of them. Consequently, the expres- 
sion of ten of the KSHV miRNAs is largely unaffected when productive 
replication is activated, whereas the expression of two of the miRNAs is 
substantially induced”. 

In the case of HCMV, the 11 known viral miRNAs were all identified 
in productively infected cells®®, and most of these viral miRNAs seem 
to be produced with ‘early’ kinetics (that is, gene expression depends on 
viral immediate early transcription factors). It remains unclear which 
HCMV miRNAs are produced during latency. By contrast, in cells 
infected with HSV-1, four viral miRNAs seem to be expressed mainly 
during latency, one exclusively during productive replication and one 
during both stages”®. Finally, in cells infected with simian virus 40 
(SV40), which is a polyoma virus, viral miRNAs are expressed with 
late kinetics”. A full appreciation of the functions of viral miRNAs will 
certainly require a more detailed understanding of how their expression 
is regulated during the viral life cycle. 
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Figure 1| How DNA virus miRNAs target host and viral mRNAs. After a host 

cell is infected by a DNA virus, the viral genome is transcribed in the nucleus 
to yield both pri-miRNAs and mRNAs. The pri-miRNA is processed by host 

factors in the nucleus to yield the pre-miRNA intermediate, which is then 
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exported to the cytoplasm, where the mature viral miRNA is generated and 
incorporated into the RISC. RISCs that are programmed by viral miRNAs 
in this way can then inhibit expression of viral and/or host mRNAs in the 
infected cell’s cytoplasm. m’G, 7-methylguanosine. 
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Table 1| Viral mRNA targets of viral miRNAs 


Virus Viral miRNA Viral mRNA target Function of viral protein 

EBV miR-BART2 BALFS DNA polymerase 
miR-BART1-5p LMP1 Signalling molecule 
miR-BART16 
miR-BART17-5p 

HvAV miR-1 ORF1 DNA polymerase 

SV40 miR-S1 T antigens Early proteins 

HSV-1 miR-H2-3p ICPO Immediate early protein 
miR-H6 ICP4 Immediate early protein 

HSV-2 miR-| ICP34.5 Pathogenicity factor 

HCMV miR-UL112-1 1E1 (IE72, UL123) Immediate early protein 


EBV, Epstein-Barr virus; HCMV, human cytomegalovirus; HSV, herpes simplex virus; HvAV, Heliothis 
virescens ascovirus; SV4O, simian virus 40. 


Although many viral miRNAs have now been identified, knowledge 
about their functions remains scarce. There are no published reports 
examining the in vivo phenotypes of viral mutants specifically lack- 
ing individual viral miRNAs, and only a small number of mRNA tar- 
gets have been described (Tables 1 and 2). It can be envisaged that viral 
miRNAs evolved to downregulate cellular mRNAs and/or viral mRNAs 
(Fig. 1). Cellular mRNA targets might include transcripts encoding pro- 
teins involved in host innate or adaptive immune responses or, more 
generally, involved in cell-cycle regulation or signal transduction. Viral 
mRNA targets might include transcripts involved in regulating the tran- 
sition from latency to productive replication (or vice versa) or products 
of immediate early genes that need to be eliminated at later stages in the 
viral life cycle as a result of toxicity or because they are targets for host 
cytotoxic T cells'*”°. Although the current understanding is limited, the 
known targets of viral miRNAs have been found to belong to almost all 
of these categories. 


Viral mRNA targets of viral miRNAs 
The first paper to describe viral miRNAs also provided the first indica- 
tion of a viral mRNA target for a viral miRNA. Specifically, one of the 
five EBV miRNAs described by Tuschl and colleagues’, miR-BART2, 
was found to lie antisense to the mRNA encoding the EBV DNA 
polymerase, also called BALF5, and was proposed to inhibit the pro- 
duction of DNA polymerase by inducing cleavage of this mRNA. 
Although there is evidence supporting partial inhibition of EBV DNA 
polymerase expression by miR-BART2 (ref. 22), the functional signifi- 
cance of this inhibition is unknown. However, inhibiting EBV DNA 
polymerase expression might promote entry of the virus to latency by 
reducing viral genome amplification early after infection. Recently, it 
was reported” that amiRNA encoded by the insect DNA virus Helio- 
this virescens ascovirus (HvAV) also downregulates expression of the 
viral DNA polymerase. Unlike miR-BART2, the HVAV miRNA does 
not lie antisense to the viral DNA polymerase mRNA and has only 
moderate homology to the proposed mRNA target, but a reduction 
in the DNA polymerase mRNA level was nevertheless observed. The 
fact that two miRNAs, expressed by two unrelated viral species, both 
reduce the level of mRNAs encoding the cognate viral DNA polymerase 
might indicate convergent evolution. 

Another example of a viral miRNA that is transcribed antisense to 
a viral mRNA, and induces degradation of that mRNA, occurs in the 
polyoma virus SV40. SV40 encodes a single pre-miRNA stem-loop 
structure that is expressed exclusively as a late gene product™. The viral 
miRNAs derived from this stem-loop structure lie antisense to the early 
viral mRNAs that encode the SV40 T antigens, which are viral tran- 
scription factors that induce the expression of late viral genes. These 
SV40 miRNAs, which show perfect complementarity to the T antigen 
mRNAs, induce the cleavage and degradation of the mRNAs and reduce 
T-antigen expression late in the SV40 life cycle. Epitopes derived from 
SV40 T antigens are recognized by cytotoxic T cells, and the effect of 
these viral miRNAs is therefore to partly protect SV 40-infected cells 
from being killed by T cells’. 


Additional cases of viral miRNAs regulating mRNAs to which they 
are antisense have been reported in HSV-1 and the related virus HSV-2 
(refs8,24). During latency, HSV-1 generates aset of five miRNAs: miR-H2-3p 
(3p denoting derivation from the 3’ side of the pre-miRNA stem), miR-H3, 
miR-H4, miR-H5 and miR-H6. One of these miRNAs, miR-H2-3p, lies anti- 
sense to the mRNA encoding the viral immediate early protein ICPO and has 
been shown to downregulate ICP0 production®. Surprisingly, miR-H2-3p 
does not, however, induce ICPO mRNA degradation, despite being fully 
complementary. Although the molecular basis for this phenomenon is 
unclear, other research groups have also reported examples of miRNAs 
or short interfering RNAs (a related class of small RNA) that reduce the 
expression of mRNAs bearing perfectly complementary targets mainly 
by inhibiting their translation™”®. 

In addition to miR-H2-3p lying antisense to ICPO transcripts, HSV-1 
miR-H3 and miR-H4 lie antisense to the mRNAs encoding the patho- 
genicity factor ICP34.5 and, on the basis of genetic data, were proposed 
to inhibit ICP34.5 expression in latently infected neurons’. This hypoth- 
esis has now been validated for the related virus HSV-2, which encodes 
a miRNA similar to miR-H3 (called miR-I)”*. When overexpressed in 
HSV-2-infected cells in culture, miR-I reduces the amount of ICP34.5 
mRNA expressed and the amount of protein produced. A final example 
of an HSV-1 miRNA that targets a viral mRNA is provided by miR-H6, 
which was shown to downregulate production of the HSV-1 protein ICP4 
(ref. 8). This miRNA does not lie antisense to the ICP4 gene in the HSV-1 
genome, but it does show extensive complementarity to ICP4 mRNA, 
including the entire miRNA sequence extending from position 2 to posi- 
tion 8 (the miRNA ‘seed’ region). Full mRNA complementarity to the 
miRNA seed region is generally, but not always, required for inhibition 
of translation”. 

Overall, it seems that four of the six known HSV-1 miRNAs function 
to downregulate viral mRNAs in latently infected cells. The combined 
action of miR-H2-3p and miR-H6, which downregulate the production 
of the HSV-1 immediate early proteins ICP0 and ICP4, respectively, 
might increase the likelihood of HSV-1 entering latency and/or inhibit 
the transition from latency to productive replication®. The inhibition of 
ICP34.5 expression by miR-H3 and, potentially, miR-H4 is more difficult 
to explain, because ICP34.5 is a pathogenicity factor that blocks activa- 
tion of the host antiviral factor PKR (double-stranded-RNA-activated 
protein kinase) and inhibits autophagy (an innate immune response in 
which cells are induced to degrade the bulk of their contents, including 
any newly formed virion particles)”*. Inhibiting ICP34.5 expression 
might shield latently infected neurons from the severe cytopathic effects 
induced by a full-blown HSV-1 productive replication cycle, and this 
idea is supported by the finding that HSV-1 mutants lacking the ICP34.5 
gene are much less neurotoxic”. 

Another example of a viral miRNA that downregulates a crucial viral 
immediate early protein is the HCMV miRNA known as miR-UL112-1, 
which downregulates production of the viral protein IE1 (also known as 
IE72 and UL123) by targeting two partly complementary sites located in 
the 3’ untranslated region (UTR) of JE1 mRNAs”. This observation 
prompted the proposal that herpesviruses in general might use miRNAs 
to regulate the expression of viral proteins that can trigger the tran- 
sition from latency to productive replication®. This hypothesis is far from 
proven, but two observations are consistent with the idea. First, miRNAs 
produced in the latent stage of HSV-1 infection downregulate production 


Table 2 | Cellular mRNA targets of viral miRNAs 


Virus Viral miRNA Host mRNA target Function of host protein 
KSHV miR-K12-11 BACH1 (and others) Transcriptional suppressor 
miR-K12-6-3p THBS1 Adhesion molecule, 
(and others) angiogenesis inhibitor 
HCMV — miR-UL112-1 MICB Natural-killer-cell ligand 
EBV miR-BART5 PUMA Pro-apoptotic factor 
miR-BHRF1-3 = CXCL11 Chemokine, T-cell attractant 


BACH, BTB and CNC homology 1; CXCL71, CXC-chemokine ligand 11; KSHV, Kaposi's sarcoma- 
associated herpesvirus; MICB, major histocompatibility complex class | polypeptide-related 
sequence B; PUMA, p53-upregulated modulator of apoptosis; THBS1, thrombospondin 1. 
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of the immediate early proteins ICP0 and ICP4 (ref. 8), as noted earlier. 
Second, recent data show that viral miRNAs generated during KSHV 
latency downregulate the production of the KSHV immediate early 
proteins Rta and Mta, which are known to have a key role in the acti- 
vation of productive KSHV replication (P. Konstantinova and B.R.C., 
unpublished observations). 

A final example ofa viral gene product that is downregulated by viral 
miRNAs is the EBV protein LMP- 1, which has been reported to be sup- 
pressed by three EBV miRNAs, miR-BART1-5p, miR-BART16 and 
miR-BART17-5p”. LMP-1 is a cytoplasmic signalling molecule that is 
produced during EBV latency and can induce cell growth and transfor- 
mation. However, overexpression of LMP-1 can result in growth inhibi- 
tion and increased apoptosis”. So the role of these miRNAs might be to 
ensure an optimal level of LMP-1 expression during EBV latency. 


Cellular mRNA targets of viral miRNAs 

In principle, viral mRNA targets for viral miRNAs should be easier 
to identify than cellular mRNA targets. If a viral miRNA is antisense 
to a viral mRNA, then this suggests an obvious potential target, 
although not all viral mRNAs lying antisense to a viral miRNA are 
downregulated by that miRNA”. Even if the viral miRNA interacts 
with a partly complementary viral mRNA, this should be an easier 
target to identify than a cellular mRNA, given that viral genomes are 
much smaller than host cell genomes. It could be envisaged that viral 
miRNAs evolved to efficiently degrade host cell mRNAs that encode 
particularly ‘troublesome’ host defence factors; however, no fully 
complementary cellular mRNA targets for viral miRNAs have been 
identified so far. Instead, viral miRNAs seem to inhibit the translation 
of cellular mRNAs bearing partly complementary sites: that is, viral 
miRNAs seem to function just like cellular miRNAs” (Table 2). 

An extreme example of this is the KSHV miRNA miR-K12-11, which 
has a seed region identical to the human cellular miRNA miR-155 
and seems to downregulate an identical, or nearly identical, set of tar- 
get mRNAs*””. The most fully characterized of these is BACH1 (BTB 
and CNC homology 1) mRNA, which contains several targets for both 
miR-K12-11 and miR-155 in its 3’ UTR. BACH 1 is a transcriptional sup- 
pressor, and the significance of this downregulation for KSHV replication 
remains unclear. Even though several human genes downregulated by 
both miR-K12-11 and miR-155 have been identified”, it is unclear why 
miR-K12-11 evolved to phenocopy miR-155. Overexpression of miR-155 
is, however, associated with B-cell transformation, so miR-K12-11 might 
contribute to the transformation of B cells by KSHV”. Interestingly, the 
avian a-herpesvirus Marek’s disease virus type 1 encodes a miRNA that 
also functions as an orthologue of miR-155 (ref. 34), and EBV (although 
it does not itself encode a miR-155 equivalent) induces endogenous 
miR-155 production in infected B cells”. It therefore seems that down- 
regulation of specific cellular genes by either miR-155 itself, or by viral 
orthologues of miR-155, might facilitate the replication of a range of 
different herpesviruses. 

Another cellular gene that is downregulated by KSHV miRNAs is 
thrombospondin 1 (THBS1). THBS1 encodes a protein that is involved 
in facilitating cell-to-cell adhesion and has been reported to have anti- 
proliferative and anti-angiogenic activities”. THBS1 expression is down- 
regulated in Kaposi’s sarcoma tumours, in keeping with the fact that 
tumour survival, particularly in highly vascularized Kaposi's sarcoma 
tumours, requires angiogenesis. Rolf Renne and colleagues** observed 
that THBS1 mRNA was downregulated in cells engineered to produce 
KSHV miRNAs and also showed that translation of THBS1 mRNA is 
inhibited by several KSHV miRNAs, in particular by miR-K12-6-3p, 
which shows miRNA seed-region complementarity to two sites in the 
THBS1 mRNA 3’ UTR. It therefore seems possible that downregulation 
of THBS1 by KSHV miRNAs contributes to the development of Kaposi’s 
sarcoma in vivo. 

An obvious prediction is that viral miRNAs might downregulate cel- 
lular mRNAs encoding antiviral factors, and three such cellular targets 
have been uncovered. First, the HCMV miRNA miR-UL112-1 has been 
reported to target mRNAs encoding MICB (major histocompatibility 
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complex class I polypeptide-related sequence B). MICB is a ligand for 
a cell-surface receptor of natural killer (NK) cells, which are innate 
immune cells that provide one of the early lines of defence against viral 
infection. The MICB-receptor interaction is a key regulator of NK-cell 
activity and hence of NK-cell killing of virus-infected cells”. The pro- 
posed target for miR-UL112-1 in the 3’ UTR of MICB mRNA is unusual 
in that it does not have complete complementarity to the seed region of 
miR-UL112-1, and in this case extensive complementarity to the central 
and 3’ regions of the miRNA might compensate”. Despite this lack of 
complete seed-region complementarity, cells producing miR-UL112-1 
were found to display less cell-surface MICB and to be resistant to NK- 
cell killing in vitro. Conversely, cells infected with a mutant form of 
HCMV lacking miR-UL112-1 had more cell-surface MICB and were 
killed more effectively by NK cells than were cells infected with wild- 
type HCMV. Interestingly, the function of MICB is also inhibited by the 
HCMV protein UL16, suggesting that UL16 and miR-UL112-1 might 
be functioning synergistically to protect infected cells against the NK- 
cell arm of the human immune system”. Recently, it was reported that 
cellular miRNAs, including miR-93, also target the 3’ UTR of the MICB 
mRNA at sites that partly overlap with, but are distinct from, the site 
targeted by miR-UL112-1 (ref. 38). Although these cellular miRNAs are 
not similar in sequence to the viral miRNA, it seems that miR-UL112-1 
is mimicking the function of a subset of cellular miRNAs and thereby 
exerting a similar protective effect against NK-cell killing. As discussed 
earlier, miR-UL112-1 has also been reported to downregulate produc- 
tion of the viral immediate early protein IE1 in HCMV-infected cells 
(Table 1), thus providing the first example ofa viral miRNA that targets 
both viral mRNAs and cellular mRNAs. 

A second example of a viral miRNA that downregulates an antiviral 
factor is EBV miR-BART5, which inhibits production of the pro-apop- 
totic protein PUMA (p53-upregulated modulator of apoptosis)’. Dep- 
letion of miR-BART5 from EBV-infected nasopharyngeal carcinoma 
cells was found to trigger higher levels of PUMA-mediated apoptosis, 
suggesting that miR-BART5 might shield EBV-infected epithelial cells, 
as well as EBV-transformed cells, from elimination by apoptosis. 

The third antiviral gene product known to be downregulated by a 
viral miRNA is CXC-chemokine ligand 11 (CXCL11), an interferon- 
inducible T-cell chemoattractant. CXCL11 mRNA is downregulated by 
EBV miR-BHRF1-3, which is present in large amounts in many EBV- 
induced B-cell tumours”. CXCL11 has also been shown to have anti- 
tumour activity in animal studies, so this finding raises the possibility 
that, by downregulating CXCL11 production, miR-BHRF1-3 might 
shield EBV-infected B cells from cytotoxic T cells in vivo. 


Conservation of viral miRNAs 

The specificity of a miRNA can be altered by changing just one or two 
bases, especially in the seed region”, so the genomic sequences enco- 
ding viral miRNAs might therefore be subject to rapid evolutionary 
drift. But ifthe presence of a particular viral miRNA results in a signifi- 
cant increase in viral replication, then the gene encoding this miRNA 
might be expected to be conserved. Furthermore, if a viral miRNA 
targets a viral mRNA, co-evolution might be expected to occur. By con- 
trast, if a viral miRNA targets a cellular mRNA, then the evolution of 
the viral miRNA gene might be expected to be restricted. 

In fact, analysis of miRNAs encoded by different members of the 
herpesvirus and polyoma virus families has so far uncovered little 
sequence conservation. One exception occurs in EBV and its simian 
relative rhesus lymphocryptovirus: 7 of the 16 miRNAs encoded by rhe- 
sus lymphocryptovirus are markedly similar to EBV miRNAs’. Because 
EBV and rhesus lymphocryptovirus are thought to have diverged 
~13 million years ago, this suggests a strong evolutionary pressure for 
retaining the same miRNA sequences, especially as genomic sequences 
adjacent to the regions encoding the mature miRNAs (for example, 
those encoding the terminal loop of the pre-miRNA) were found to 
have diverged significantly’. 

By contrast, other related viruses (for example KSHV and rhe- 
sus rhadinovirus or Marek’s disease virus types 1 and 2) show no 
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miRNA sequence conservation””, although the genomic location of 
the miRNAs encoded by these viruses is conserved. Conservation 
of genomic location, but lack of sequence similarity, is also observed 
for the simian polyoma virus SV 40 and its human relatives the BK virus 
and JC virus, all of which express miRNAs that are antisense to, and 
degrade, viral T-antigen mRNAs'”. This finding might imply that 
the sole function of the miRNAs expressed by these viruses is to target 
these particular viral mRNAs’. However, viral miRNAs with no known 
viral mRNA targets also tend to be transcribed from the same genomic 
location, even when their nucleotide sequences have diverged”””. So it 
might simply be easier for favourable sequence changes to be selected 
in genomic sequences that encode a pre-existing miRNA stem-loop 
structure than for a novel stem-loop structure to be generated de novo. 
It therefore remains possible that these diverse polyoma virus miRNAs 
also target cellular mRNAs for downregulation. Moreover, the fact that 
two viral miRNAs have divergent sequences does not necessarily imply 
that they have different functions. Two distinct miRNAs, encoded by 
two different viruses, could, for example, target two distinct regions 
in a single mRNA 3’ UTR, or they could target two gene products that 
function at different steps in the same host metabolic pathway, resulting 
in a similar phenotype. Until the mRNA targets for viral miRNAs are 
better understood, and until there is some idea of their in vivo func- 
tions, the conservation (or lack of conservation) of viral miRNAs is not 
readily interpretable. 


Outlook 

Despite our still limited knowledge of viral miRNA functions, the large 
number of miRNAs that are encoded by diverse members of the herpes- 
virus family, and their high-level expression during latent infections, 
suggests that these small non-coding RNAs have a key role in regulating 
viral pathogenesis in vivo. In particular, it will be important to test the 
hypothesis that herpesvirus miRNAs that are produced during latency 
help to maintain the latent state*”’, which could be examined by using 
viral mutants and/or antisense reagents. It certainly seems possible that 
antisense reagents specific for particular viral miRNAs could signifi- 
cantly attenuate herpesvirus-induced diseases in humans, if they could 
be delivered effectively to infected cells in vivo. a 
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The promises and pitfalls of RNA- 
interference-based therapeutics 


Daniela Castanotto' & John J. Rossi’ 


The discovery that gene expression can be controlled by the Watson-Crick base-pairing of small RNAs 

with messenger RNAs containing complementary sequence — a process known as RNA interference — has 
markedly advanced our understanding of eukaryotic gene regulation and function. The ability of short RNA 
sequences to modulate gene expression has provided a powerful tool with which to study gene function and 
is set to revolutionize the treatment of disease. Remarkably, despite being just one decade from its discovery, 
the phenomenon is already being used therapeutically in human clinical trials, and biotechnology companies 
that focus on RNA-interference-based therapeutics are already publicly traded. 


Before 1980, RNA was generally considered to be no more than a pas- 
sive intermediate carrying information between DNA and protein 
synthesis. The discovery of catalytic RNAs in the early 1980s merited a 
shared Nobel prize to Tom Cech and Sidney Altman, and in 1986 the 
concept of ‘the RNA world; an idiom created by Walter Gilbert, was 
proposed. Today, this isa common expression, and RNA has claimed a 
pivotal place in cellular biology. 

Just ten years ago, RNA’s functional repertoire was expanded fur- 
ther with the discovery in the nematode Caenorhabditis elegans' that 
double-stranded RNAs (dsRNAs) can trigger silencing of complemen- 
tary messenger RNA sequences, and the term ‘RNA interference’ 
(RNAi) was born. Shortly thereafter, short dsRNAs — or short inter- 
fering RNAs (siRNAs) (reviewed in ref. 1) — were generated artificially 
and used to demonstrate that this process also occurs in mammalian 
cells, usually, but not always, without triggering the innate immune 
system, which normally recognizes RNAs as part of an antiviral 
defence mechanism (see page 421). The knowledge that small RNAs 
can affect gene expression has had a tremendous impact on basic and 
applied research, and RNAiis currently one of the most promising new 
approaches for disease therapy. 

That RNAi could be triggered in vivo in mammals was first shown in 
animals infected with hepatitis B virus”. This was followed by the first 
therapeutic application of siRNAs: siRNAs were targeted to Fas mRNA 
in a mouse model of autoimmune hepatitis, resulting in protection of 
the treated animals against liver fibrosis’. In 2004, only six years after the 
discovery of RNAi, the first siR NA-based human therapeutics — devel- 
oped as treatments for wet age-related macular degeneration — entered 
phase I clinical trials. RNAi is one of the fastest advancing fields in biol- 
ogy, and the flow of discoveries gives true meaning to the expression 
‘from the bench to the bedside’. 

Although much is known about the mechanisms of RNAi, there are 
a number of challenges that applications of this gene-silencing technol- 
ogy need to overcome. For one, RNAi is a fundamentally important 
regulatory mechanism in the cell, and tapping into it in the interests of 
therapeutic benefit could result in side effects. Exogenously introduced 
dsRNA sequences can sequester components that make up the cellular 
machinery involved in gene silencing (see page 396), thereby reducing 
the accessibility of the machinery to a class of small RNAs known as 
microRNAs (miRNAs) that are entering the natural cellular pathway*”. 


In addition, some synthetic siRNAs contain sequence motifs that can 
induce type I interferon responses and stimulate the production of 
pro-inflammatory cytokines®*. 

During the past few years, many scientists have searched for solu- 
tions to overcome these limitations and to increase the safety of poten- 
tial RNAi-based therapeutics. This article explores recent strategies to 
minimize undesirable secondary effects, describes new approaches 
to delivery and discusses RNAi therapies that are being tested. As it is 
anticipated that this technology will be applied to an increasing range 
of diseases, the potential problems and solutions that could one day 
transform RNAi into a conventional treatment for human diseases 
warrant careful attention. 


Endogenous gene silencing 

The effector RNA molecules of RNAi consist of ~20-30 nucleotides’. 
They are complexed with the protein components of the RNA-induced 
silencing complex (RISC). Its catalytic core in plants and animals (with 
the exception of single-celled organisms) is AGO2, a member of the 
highly conserved Argonaute protein family’®. These small RNAs can 
silence gene expression by two mechanisms: post-transcriptional 
gene silencing (PTGS)"', and transcriptional gene silencing (TGS)'*"? 
(Fig. 1). PTGS can, in turn, be divided into two main mechanisms: 
direct sequence-specific cleavage, and translational repression and RNA 
degradation. Direct sequence-specific cleavage occurs when the tar- 
geted mRNA is perfectly complementary to the siRNA and is degraded 
after site-specific cleavage by the RISC. Translational repression and 
RNA degradation occur when the small RNA guide sequence has only 
limited complementarity to the target in the ‘seed’ region (nucleotides 2 
to 8 from the 5’ end of the guide strand), with base-pairing usually 
occurring in the 3’ untranslated region (UTR). The latter mechanism 
is used by miRNAs. 

TGS has been demonstrated in Schizosaccharomyces pombe (fission 
yeast), plants and, most recently, mammalian cells?!” In S. pombe, the 
process is mediated by the RNA-induced transcriptional silencing com- 
plex (RITS), which contains Agol, the chromodomain protein Chp1 
and the glycine and tryptophan (GW)-repeat-containing protein Tas3 
(ref. 18) (see page 413). Although in mammalian cells the mechanism 
by which small-RNA- directed silencing occurs is still hotly debated, 
both AGO] and AGO2 have been shown to be integral to the overall 
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process'””°. Most recently, a miRNA (miR-320) has been shown to 
regulate transcription of the POLR3D subunit of RNA polymerase III 
(Pol III)” in human cell culture. 

Endogenous small RNAs have been found in various organisms, 
including humans, mice, the fruitfly Drosophila melanogaster and 
C. elegans. Many of these originate from transposons, viruses and 
repetitive sequences and are characterized by their interactions with 
the PIWI subfamily (or PIWI clade) of Argonaute proteins” * — these 
are thus named PIWI-interacting RNAs (piRNAs). The identification of 
piRNAs has been restricted to germline cells. Recently, a new class 
of endogenous siRNAs (endo-siRNAs or esiRNAs) has been identi- 
fied in the gonads and somatic tissues of D. melanogaster” and in 
mouse oocytes*””!. In mice, endo-siRNAs have been proposed to regu- 
late retrotransposon movement™”’. Several families of small RNAs, 
including repeat-associated siRNAs (ra-siRNAs), tiny non-coding 
RNAs (tncRNAs), trans-acting siRNAs (ta-siRNAs) and scan RNAs 
(scnRNAs) (Table 1) are found in fungi, plants and animals, but so far 
none of these has been observed in mammals. The evidence suggests 
that piRNAs act through different cellular pathways from siRNAs 
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and miRNAs and so could offer alternative targeting strategies for 
therapeutic targets. 


Superior designs for small molecules 
Cellular genes can be targeted by exogenous introduction of siRNAs, 
which then take advantage of the endogenous PTGS mechanism. The 
siRNAs can be either transfected into cells, where they enter the RISC 
directly, or generated within cells through gene expression by the use 
of vectors containing Pol I or Pol III promoters. These RNAi trig- 
gers can be expressed in animals and plants, but not in S. pombe, in 
the form of miRNAs or as short hairpin RNAs (shRNAs), which are 
cleaved into small (~21-25-nucleotide) RNAs by the enzymes Drosha 
and/or Dicer. In both cases, if the two strands of the RNA trigger are 
completely complementary, the passenger strand is cleaved by AGO2 
(refs 32, 33), leaving behind a single-stranded guide sequence, which 
acts as the template for recognition of the targeted gene sequence by 
the RISC (Fig. 1). 

Most of the impending therapeutic applications based on RNAi pro- 
pose using direct introduction of synthetic siRNAs. The advantage of 
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Figure 1| Mechanisms of cellular gene silencing. a, Primary microRNAs 
(pri-miRNAs) are, in plants and animals, processed by Drosha and 

its partner DGCR8 into precursor miRNAs (pre-miRNAs) and then 
transported to the cytoplasm by exportin 5 (XPOS). In the cytoplasm, 
they are bound by a Dicer-containing pre-RISC and processed to 

yield the guide sequence that is loaded into the holo-RISC, which 
contains all the components required for gene silencing. AGO2 is the 
catalytic core of the RISC (present but not shown in the schematically 
drawn holo-RISC). The guide sequence binds to the corresponding 
target sequences in the 3’ UTRs of cellular mRNAs. If the miRNA guide 
sequence is fully complementary to its target site (left pathway), it 
triggers site-specific cleavage and degradation of the mRNA through 
the catalytic domain of AGO2. If the base-pairing is incomplete (right 
pathway) but includes pairing of the seed region (nucleotides 2-8 of the 
miRNA) with the target, translational inhibition occurs, and this can 


be accompanied by non-sequence-specific degradation of the mRNA 

in P bodies. b, Similarly to miRNAs, artificially transcribed shRNAs 

(in this case from a plasmid) are transported to the cytoplasm by XPO5. 
The dsRNA in the cytoplasm is recognized and processed by Dicer into 
~21-25-nucleotide siRNA fragments that are loaded into the RISC. The 
siRNAs can target complementary sequences of cellular mRNAs and 
trigger their degradation through AGO2-mediated cleavage. c, When 
siRNAs are present in the nucleus and are complementary to promoter 
regions, they can trigger chromatin remodelling and histone modifications 
that result in transcriptional gene silencing. In mammalian cells, the 
details of this mechanism are still under investigation but are known to 
include Argonaute-family proteins. Accessory proteins indicated in the 
figure are TRBP (HIV tar-RNA-binding protein; also known as TRBP2P) 
and PACT (activator of protein kinase PKR; also known as PRKRA). 
m’G, 7-methylguanosine. 
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Table 1| Cellular small RNAs involved in gene silencing 


Class Size Functions Mechanisms Origin Organisms found in 
(nucleotides) 
siRNAs 21-25 Regulating gene expression, RNA degradation, Intergenic regions, exons, introns | Caenorhabditis elegans, 
providing antiviral response, transposon restriction Drosophila melanogaster, 
restricting transposons Schizosaccharomyces pombe, 
Arabidopsis thaliana, Oryza sativa (rice) 
endo-siRNAs 21-25 Restricting transposons, regulating RNA degradation Transposable elements, D. melanogaster, mammals 
mRNAs and heterochromatin pseudogenes 
miRNAs 21-25 Regulating gene expression at Blocking translation, Intergenic regions, introns C. elegans, D. melanogaster, S. pombe, 
the post-transcriptional level RNA degradation A. thaliana, O. sativa, mammals 
piRNAs 24-31* Regulating germline development Unknown Defective transposon sequences __ C. elegans, D. melanogaster, Danio rerio, 
and integrity, silencing selfish DNA and other repeats mammals 
ra-siRNAs 23-28 Remodelling chromatin, Unknown Repeated sequence elements C. elegans, D. melanogaster, S. pombe, 
transcriptional gene silencing (subset of piRNAs) Trypanosoma brucei, D. rerio, A. thaliana 
ta-siRNAs 21-22 Trans-acting cleavage of RNA degradation Non-coding endogenous D. melanogaster, S. pombe, A. thaliana, 
endogenous mRNAs transcripts O. sativa 
natRNAs 21-22 Regulating gene expression at RNA degradation Convergent partly overlapping A. thaliana 
the post-transcriptional level transcripts 
scnRNAs 26-30 Regulating chromatin structure DNA elimination Meiotic micronuclei Tetrahymena thermophila, 


Paramecium tetraurelia 


tncRNAs 22 Unknown Unknown 


Non-coding regions C. elegans 


*C. elegans piRNAs are 21 nucleotides. endo-siRNAs, endogenous siRNAs; miRNAs, microRNAs; natRNAs, natural antisense transcript siRNAs; piRNAs, PIWI-interacting RNAs; ra-siRNAs, repeat-associated 
siRNAs; scnRNAs, scan RNAs; siRNAs, short interfering RNAs; ta-siRNAs, trans-acting siRNAs; tncRNAs, tiny non-coding RNAs. 


using a chemically synthesized molecule is that chemical modifications 
can be introduced to increase stability, promote efficacy, block binding to 
unintended targets that contain sequence mismatches (specific off-target 
effects), and reduce or abrogate potential immunostimulatory effects 
(general off-target effects). However, the effects of these molecules are 
transient, whereas the promoter-expressed shRNAs or miRNAs can 
potentially mediate long-term silencing with a single application. 

Conventional siRNAs are ~22 nucleotides and have 3’ dinucleotide 
overhangs that mimic Dicer cleavage products. Because not all siRNAs 
achieve equivalent levels of target knockdown, large-scale siRNA screen- 
ing is often performed for any given target to find the most potent inhibi- 
tors. These have yielded some rules for siRNA design. For example, to 
facilitate incorporation into the RISC, the 5’ end of the antisense (guide) 
strand should be designed to have a lower thermodynamic stability than 
the 5’ end of the sense strand. The proportion of the nucleotides guanos- 
ine and cytidine should be around 50% or lower, and targeting of known 
protein-binding sites in mRNA regulatory regions should be avoided 
because binding of regulatory proteins may block siRNA-target pair- 
ing. For the same reason, intramolecular structures in the target should 
be avoided. Statistical analyses have also found a preference for certain 
nucleotides at specific positions within the siRNA*™. Many computer 
programs are available for identifying the optimal target sequences for a 
given gene” +35 One of these, an artificial neural network, has been used 
to develop a genome-wide siRNA library for humans and to identify 
effective siRNAs for 34 targets”. 

Chemical modifications are often included in the design of synthetic 
siRNAs. Selective addition of phosphorothioate linkages or substitution 
of 2’ fluoropyrimidines or a 2’-O-methy] for the 2’ ribose at certain pos- 
itions does not compromise siRNA activity and concomitantly increases 
resistance to ribonucleases”, which is important for in vivo applications. 
A single 2'-O-methyl group on the passenger strand ofan siRNA duplex 
can abrogate activation of the Toll-like receptors* and prevent toxicities 
due to the activation of type | interferon pathway gene expression. It has 
recently been demonstrated that fluoro-B-p-arabinonucleic acid (FANA”™ 
or as 4’-S-FANA”) or arabinonucleic acid (ANA“’) modifications can 
increase both the serum stability and the potency of siRNAs. Some chem- 
ical modifications also have the important advantage of decreasing or 
blocking the activity of the siRNAs sense (passenger) strand, thereby 
reducing specific off-target effects. Other modifications, such as the add- 
ition of lauric acid, lithocholic acids and cholesterol derivatives, can be 
made to increase cellular uptake”, which is currently one of the main 
hurdles of RNAi therapy. 
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Breaking and entering 

Therapeutic applications of siRNAs require effective delivery to the tar- 
get cells and tissues. The two main strategies are delivery of chemically 
synthesized siRNAs (non-viral delivery), or delivery of shkRNA-encoding 
genes by engineered viruses that will ultimately generate siRNAs by 
transcription in the target cells. 


Non-viral delivery 

Because of their size and negative charge, siRNAs cannot easily cross 
cell membranes. Delivery has therefore been one of the major challenges 
for RNAi technology. Various means of delivery have been developed 
and tested in murine and non-human primate models, ranging from 
the injection of naked RNAs into a target organ such as the lung or eye 
to systemic delivery of the RNA in nanoparticles, complexed with poly- 
cations, attached to cholesterol groups or conjugated with cell-surface 
receptors. Some delivery approaches are detailed in Fig. 2. 

Two polymers that have been examined for their delivery properties 
are atelocollagen and chitosan. Chitosans have mucoadhesive properties 
and have been used for intranasal delivery to bronchiolar epithelial cells”. 
Intranasal delivery has proved an effective means of delivering siRNA in 
mice“ and in non-human primates” to block respiratory syncytial virus 
infection of the upper respiratory tract. In fact, the delivery of siRNAs to 
mucosal membranes seems to be an effective approach in general. For 
example, intravaginal delivery of lipid-encapsulated siRNAs targeting 
herpes simplex virus 2 (HSV-2) provided protection against lethal viral 
infection in more than two-thirds of the siRNA-treated mice”. 

Targeting ofanti-apolipoprotein B (APOB) and peroxisome proliferator- 
activated receptor-a (PPAR-a) siRNAs to the liver has been achieved 
by means of a ‘membrane-active’ polymer, which can mask its activ- 
ity until it reaches the endosome, resulting in the delivery of siRNAs 
to hepatocytes after a simple intravenous injection”. A different siRNA 
delivery approach used transferrin conjugated to a cyclodextrin-poly- 
cation polymer to deliver siRNAs targeting the Ewing's sarcoma Ews-Flil 
fusion mRNA by means of the transferrin receptor in mice“, resulting 
in inhibition of tumour progression. And conjugation of an siRNA to 
a cholesterol group permitted its delivery to the liver and the jejunum, 
where it downregulated its target, APOB, leading to consequent lowering 
of blood cholesterol levels ina murine model system”. 

An important advance for siRNA delivery was the successful applic- 
ation of stable nucleic-acid lipid particles decorated with polyethylene 
glycol (PEG) polymer chains (termed SNALPs) for the delivery of siRNAs 
directed against APOB mRNA (APOB-targeted siRNAs) to the livers of 
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non-human primates”. In this case, the siRNA effect ofa single intra- 
venous injection lasted for more than 11 days and resulted in greater 
than 90% target knockdown and no toxicity”’. These exciting results 
have increased confidence in the potential of therapeutic siRNAs for 
treating liver diseases. 

Until recently, most approaches to in vivo delivery have targeted a 
particular organ, primarily the eye, the lungs or the liver. A significant 
advance in targeting siRNAs to a specific class of leukocytes involved 
in gut inflammation has now been reported”. In this study, a cyclin D1 
(Cyd1)-targeted siRNA was loaded into stabilized nanoparticles, the sur- 
faces of which incorporated an antibody specific for a receptor expressed 
by the leukocytes. The targeted siRNA-containing nanoparticles down- 
regulated the cyclin D1 target, suppressed leukocyte proliferation and 
reversed experimentally induced colitis in mice”. 

Delivery of siRNAs to the nervous system has been particularly chal- 
lenging. The brain is notoriously refractory to targeting because of dif- 
ficulties in crossing the blood-brain barrier. However, delivery of siRNAs 
to the peripheral nervous system by direct infusion into the brain for 
the relief of chronic pain” ™ or anxiety” has been demonstrated in rats. 
Conjugates of liposomes and antibodies or neuropeptides have also been 


a_ Cholesterol conjugates 


Cholesterol 


¢ SNALPs 


Figure 2 | In vivo delivery strategies for therapeutic siRNAs. a, Cholesterol 
groups can be linked to modified siRNAs to enhance their stability 

before systemic delivery. The most common siRNA modifications are 
2'-O-methyluridine or 2’-fluorouridine substitutions (blue circles) 
combined with phosphorothioate linkages. b, Polycation nanoparticles 

can direct delivery of the siRNAs to specific cells through the use of surface 
ligands (such as transferrin) that bind to receptors on target cells. c, SNALPs 
encapsulate modified siRNAs into cationic or neutral lipid bilayers coated 
with diffusible PEG-lipid conjugates. SNALPs allow siRNAs to be taken 
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used to deliver siRNAs into the murine brain®. Nevertheless, these meth- 
ods do not target neurons, and a less invasive alternative to direct cranial 
injection is required to make such therapies more palatable. 

A recent study unlocked the possibility of selective delivery of siRNAs 
to the central nervous system by systemic intravenous injection”. The 
siRNA involved — designed to target Japanese encephalitis virus — was 
conjugated with a short peptide derived from the rabies virus glyco- 
protein, which binds to the neuronal cell acetylcholine receptor. After 
transvascular delivery, 80% of the mice treated with the therapeutic siRNA 
survived infection with Japanese encephalitis virus, whereas 100% of the 
untreated controls died from complications of the infection”. 

Another interesting approach that allows systemic and targeted siRNA 
delivery uses a protamine-antibody fusion protein”. The protamine moi- 
ety is linked to the heavy-chain antigen-binding region (Fab) of an anti- 
body to the human immunodeficiency virus 1 (HIV-1) envelope protein 
gp160. The positively charged protamine binds the negatively charged 
siRNAs — which are targeted against the HIV gene gag — allowing selec- 
tive delivery to cells expressing the gp160 envelope protein on their sur- 
faces”®. This results in internalization of the antibody—siRNA complex, 
release of the siRNAs and downregulation of the HIV Gag-encoding 


d= MEA-dynamic polyconjugate particles 


Ligand 


up by cells and released by endosomes. d, Masked endosomolytic agent 
(MEA)-Dynamic PolyConjugates (DPCs) are similar to SNALPS but 
smaller, and contain a ligand that allows targeted cell delivery. The release 
of the siRNA from the endosome is also improved by the inclusion of a 
pH-labile bond in the MEA-DPC particles. e, Tagging specific antibodies 
with protamine or other positive charges allows the delivery of siRNAs to 
specific cell types via receptor-mediated uptake. f, Chemically linking or 
co-transcribing siRNAs with RNA aptamers allows the targeted delivery 
of the siRNAs to cells expressing the appropriate receptor. 


429 


© 2009 Macmillan Publishers Limited. All rights reserved 


Table 2 | Current clinical trials of RNAi-based therapeutics 


siRNA Company Disease Stage 
Bevasiranib Acuity Pharmaceuticals Wet age-related macular degeneration Phase Ill 
Diabetic macular oedema Phase II 
Sirna-O27 Merck-Sirna Therapeutics Wet age-related macular degeneration Phase Il 
RTP801i-14 Quark Pharmaceuticals, and Silence Therapeutics Wet age-related macular degeneration Phase IZIIA 
ALN-RSVO1 Alnylam Pharmaceuticals Respiratory syncytial virus infection Phase ll 
NUC B1000 Nucleonics Hepatitis B Phase | 
Anti-tat/rev shRNA City of Hope National Medical Center, and Benitec AIDS Pilot feasibility study 
CALAA-O1 Calando Pharmaceuticals Solid tumours Phase 
TD101 TransDerm, and the International Pachyonychia Pachyonychia congenita Phase 


Congenita Consortium 


transcripts in a murine model in vivo. In this same study, fusion to pro- 
tamine of an antibody specific for the hormone receptor ERBB2 allowed 
siRNA targeting of cancer cells expressing that receptor™. 

A similar technology for specific targeted delivery is based on 
aptamer-RNAi chimaeras”. Aptamers are in vitro-evolved, synthetically 
prepared nucleic acids that selectively bind specific ligands. An RNA 
aptamer designed to bind prostate-specific membrane antigen (PSMA; 
also known as FOLH 1) was linked to a PLK1-targeted siRNA, and bind- 
ing of the aptamer to the PSMA receptor resulted in the selective delivery 
into prostate cancer cells of siRNAs that target pro-survival genes”. 
Intratumoral injection of the PSMA-Plk1-targeted siRNA or PSMA- 
Bcl2-targeted siRNA conjugates into a mouse xenograft model resulted 
in triggering of apoptosis, growth inhibition and tumour regression”. 

A different conjugation of an siRNA to vitamin-A-coupled liposomes 
succeeded in delivering antifibrotic siRNAs to hepatic stellate cells, 
which are produced in response to liver damage”. In this study, mul- 
tiple siRNA treatments targeting collagen chaperone-encoding genes 
reversed liver fibrosis by preventing collagen deposition and increased 
survival in rats, providing a potential therapeutic approach to treating 
liver cirrhosis. 

Also noteworthy is the recent report of libraries of lipid-like molecules 
(lipidoids) that can be selected for siRNA delivery to various tissues”. 


Viral delivery 

An alternative means of triggering RNAi is through promoter-expressed 
siRNA sequences processed from shRNAs or miRNA mimics. The 
genes encoding these hairpin structures are most commonly inserted 
into the backbones of viral vectors under the control of Pol II or Pol III 
promoters. A potential advantage of vector delivery is that a single 
administration triggers long-term expression of the therapeutic RNAi. 
This is particularly appropriate for chronic viral diseases such as HIV 
and viral hepatitis. 

Lentiviral vectors have been used successfully to deliver shRNA con- 
structs in various mammalian systems. For example, it was shown that 
downregulation of an activated Ras oncogene by a lentiviral-delivered 
shRNA resulted in suppression of tumour growth in mice™. And down- 
regulation of the expression of a mutant form of superoxide dismutase 1 
(SOD1) in mouse models of amyotrophic lateral sclerosis delayed the 
onset of disease™®. More recently, a lentiviral vector was used to deliver 
a Smad3-targeted shRNA for regeneration of satellite cells and repair 
of old tissue in aged and injured muscle®. Viral-vector expression of 
shRNAs has also been explored in mouse models of neurodegenerative 
disorders such as Huntington's disease and Alzheimer’s disease”. 

To deliver genes to the central nervous system, adenoviral vectors 
have proved very useful. For instance, direct brain injection of an adeno- 
viral vector expressing a shRNA directed against the mRNA encoding 
the polyQ-harbouring SCA 1-encoding transcript of spinocerebellar 
ataxia type 1 was shown to be an effective treatment in a mouse model 
of this disorder®. 

Despite the successes of viral delivery, it is important to bear in mind 
that although some viruses are non-pathogenic, they are still potentially 
immunogenic. Another major concern with this technique is the risk 
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of incurring mutations in viral sequences, causing insertional muta- 
genesis or triggering aberrant gene expression. However, viral vectors 
can transduce both dividing and non-dividing cells, yield a prolonged 
expression of the therapeutic gene and need not be delivered in large 
doses. Ultimately, any therapeutic gene when expressed in large quan- 
tities has the potential to cause toxicity and immunogenicity. Critical 
parameters such as tolerability, long-term expression, efficacy and the 
ability to regulate expression and targeting should be taken into con- 
sideration when choosing a delivery method. There is no ideal delivery 
system for every application; rather, the delivery method needs to be 
tailored to the application. 


Clinical trials using RNAi to treat human diseases 

For a new technology, siRNAs have moved into the clinic at an unpre- 
cedented pace. Some examples of the diseases and siRNA-targeting 
strategies that are currently under investigation are described below. 

The first siRNA protocol granted investigational new drug (IND) 
status and tested in a human clinical trial is the vascular endothelial 
growth factor (VEGF)-targeted siRNA Bevasiranib (Acuity Pharmaceu- 
ticals, Philadelphia, Pennsylvania) for the treatment of wet age-related 
macular degeneration (see Table 2 for a summary of ongoing siRNA 
clinical trials). This involves the overgrowth of blood vessels behind the 
retina, and causes severe and irreversible loss of vision; it affects 1.6 mil- 
lion people in the United States alone, and it is predicted that 11 million 
individuals worldwide will have the disease by 2013. Preclinical studies 
of Bevasiranib in mice showed reduced neovascularization resulting 
from downregulation of Vegf expression after direct ocular injection 
of the siRNA”. This siRNA, which is nowina phase III trial, is also in 
a phase II clinical trial for the treatment of diabetic macular oedema. 
By the conclusion of these trials, several hundred patients will have 
received the siRNA treatments. 

Two other companies are also focusing on siRNA-based treatments 
against macular degeneration: Merck's Sirna Therapeutics (San Francisco, 
California) with an siRNA (Sirna-027) that targets the VEGF receptor 
VEGFRI, and Quark Pharmaceuticals (Fremont, California) in col- 
laboration with Silence Therapeutics (London and Berlin; previously 
SR Pharma), with one targeted against a hypoxia-inducible gene, RTP801 
(also known as DDIT4), that is known to be involved in disease progres- 
sion. This siRNA, RTP801i-14, has been licensed to Pfizer, UK, which is 
now runninga phase I/TIA clinical trial. Quark Pharmaceuticals has also 
received IND status for another preclinical trial, in which it is currently 
enrolling patients. This trial is for an siRNA targeting T'P53 mRNA (which 
encodes the protein p53), inhibition of which delays the induction of cell- 
death pathways and thereby reduces acute kidney injury after surgery. 

Calando Pharmaceuticals (Pasadena, California), meanwhile, has ini- 
tiated a phase I clinical trial for solid tumours using an siRNA that targets 
a subunit of ribonucleotide reductase (RRM2), an enzyme required for 
the synthesis of DNA building blocks. Importantly, this trial is the first to 
utilize receptor-mediated delivery of siRNAs, which are encapsulated in 
cyclodextrin particles decorated with transferrin. This results in uptake 
by cells expressing the transferrin receptor, which is highly expressed 
on cancer cell surfaces. 
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The clinical trials performed by Acuity Pharmaceuticals and Merck’s 
Sirna Therapeutics successfully stabilized patients’ conditions against 
further degeneration and improved their vision without adverse effects. 
These results engendered great optimism for intravitreal injection of 
siRNAs, but in a stunning turn of events a report by Kleinman et al. 
demonstrated that the observed decrease in vascularization could be 
a consequence not of an siRNA-specific effect on angiogenesis, but 
rather a nonspecific activation of Toll-like receptor 3 (TLR3) and sub- 
sequent activation of interferon-y and interleukin 12, which, in turn, 
downregulate VEGF”. In other words, both the targeted and the con- 
trol siRNAs mediated nonspecific inhibition of angiogenesis through 
a direct interaction of the siRNAs with TLR3. Cellular uptake is not 
necessary for this effect, and because TLR3 is involved in several other 
cellular pathways the finding has highlighted another level of concern 
for safe clinical use of siRNAs. 

Alnylam Pharmaceuticals (Cambridge, Massachusetts) is a well-estab- 
lished siRNA-therapeutics company whose leading candidate siRNA, 
ALN-RSV01, is now in a phase II clinical trial. This siRNA targets res- 
piratory syncytial virus — which affects almost 300,000 people every 
year in the United States alone — by silencing the virus’s nucleocapsid 
‘N’ gene, a gene essential to viral replication. ALN-RSV01 was the first 
antiviral siRNA to enter clinical trials, and trials will soon be expanded 
to paediatric patients. Thus far it has been shown to be effective and well 
tolerated. Recently, Alnylam Pharmaceuticals formed an exclusive alliance 
with Kyowa Hakko Kogyo to develop and commercialize ALN-RSV01 in 
Japan and other Asian countries. 

Also in development at Alnylam Pharmaceuticals are siRNAs directed 
against genes implicated in hypercholesterolaemia, Huntington's disease 
(in a joint venture with Medtronic of Minneapolis, Minnesota), hepa- 
titis C (in a joint venture with Isis Pharmaceuticals in Carlsbad, Cali- 
fornia), progressive multifocal leukoencephalopathy (in a joint venture 
with Biogen Idec of Cambridge, Massachusetts) and pandemic flu (ina 
joint venture with the Swiss company Novartis). 

The International Pachyonychia Congenita Consortium (IPCC), in 
collaboration with TransDerm (Santa Cruz, California), has developed 
an siRNA to allow the correct production of keratin as a treatment for a 
rare skin disorder called pachyonychia congenita. 

The City of Hope National Medical Center in Duarte, California, in 
collaboration with Benitec (Melbourne, Australia), has started a phase I 
trial for the treatment of AIDS lymphoma. This trial uses a Pol III pro- 
moter-expressed shRNA targeting the HIV tat and rev shared exons. 
The shRNA has been incorporated into an HIV-based lentiviral vector, 
which in turn has been used to insert the shRNA gene (along with two 
other RNA-based anti-HIV genes) into blood stem cells”. The gene- 
modified stem cells have been infused into HIV-positive patients in a 
trial that uses autologous bone marrow transplantation to treat AIDS- 
related lymphomas. Four patients have now been treated in this trial. 

As indicated above, partnerships have become quite accepted in the 
field of siRNA biotechnology. These consortia are considerably increas- 
ing the capital available for these efforts and are shortening the time 
involved in commercializing siRNA-based drugs. 

Some companies, such as Regulus Therapeutics (Carlsbad, Califor- 
nia), have chosen to focus on miRNAs as therapeutic targets. Santaris 
Pharma in Horsholm, Denmark, has recently started the first phase I trial 
to target ahuman miRNA (miR-122). In this trial, miR-122 is being tar- 
geted for downregulation with a locked nucleic acid (LNA) anti-miRNA 
(SPC3649). LNA is a backbone modification that enhances the hybridiz- 
ation of the oligonucleotide with its target and protects it from nuclease 
degradation. The approach is intended to treat hepatitis C virus infec- 
tion because miR-122 facilitates replication of this virus in the liver”””’. 
Downregulation of miR-122 is also potentially useful in the treatment of 
hypercholesterolaemia. Targeting miRNAs expressed in the heart, such as 
miR-208, which regulates cardiac hypertrophy and fibrosis”, may have an 
advantage, because in the medical field there is a considerable experience 
in delivering drugs directly into this organ. 

Gain or loss of miRNA function has been linked to the onset and pro- 
gression of various diseases’””’. Protein function can be regulated either 


directly or indirectly by miRNAs, and alterations in miRNA expression 
can have profound effects on gene regulation. In instances in which 
disease results from altered miRNA expression, it is conceivable that 
normal levels could be achieved, either by targeting the specific miRNA 
if expression is too high or by delivering a miRNA mimic if expression is 
too low. However, the specificity and efficacy of delivery systems would 
need to be improved for this goal to be accomplished. Moreover, correct 
modulation of the targeted miRNA’ expression is not an easy task, and 
it is not clear whether one miRNA can be specifically targeted without 
affecting other miRNAs of the same family. 

The regulatory complexities of miRNAs should also be taken into 
consideration when either ablation or restoration of miRNA function is 
being considered in a therapeutic setting. A single miRNA can regulate 
the levels of hundreds of proteins”, raising cautionary flags about the 
consequences of downregulating or ectopically expressing even a single 
miRNA species. 


The safety issue 

The application of siRNAs to therapeutics has raised a number of con- 
cerns about their safety. After the initial excitement, a number of reports 
underscored potential drawbacks to this promising technology. The 
first warning came from a study that recorded the deaths of mice after 
Pol III promoter-driven expression of shRNAs in the liver*. The exact 
mechanisms leading to mortality are still under investigation, but seem 
to be due at least in part to saturation of the transport factor, expor- 
tin 5, that ferries miRNAs from the nucleus to the cytoplasm. There 
are now indications that other factors involved in the RNAi process 
can also be saturated by high-level expression of exogenous siRNAs, 
which can sequester them from their cognate cellular miRNAs. Because 
each cellular miRNA can potentially modulate the expression of several 
hundred genes””, minor alterations in the miRNA pathway can have 
major consequences. 

One strategy to mitigate this problem is to use the lowest possible con- 
centration of siRNAs that provides therapeutic efficacy by designing the 
exogenous siRNAs to be Dicer substrates (by increasing their length). 
These RNAs enter the RNAi pathway upstream of the RISC at the step of 
Dicer cleavage, which facilitates passing the siRNA to AGO2 for selection 
of the guide strand, often resulting in enhanced RNAi at lower concen- 
trations than can be achieved with the exogenous delivery of cognate 
21-base siRNAs”. Although small amounts of siRNAs are not expected 
to saturate the RNAi machinery, they can compete with miRNAs for 
selective incorporation into the RISC°. The long-term consequences of 
such competition are poorly understood. 

With the use of microarrays, it has become increasingly obvious that 
introducing foreign siRNAs into the cell alters the expression of non- 
target genes, as well as target genes***>; as few as six or seven nucleotides 
complementary to the seed region could result in a specific off-target 
effect®* through a miRNA-like mechanism. Because microarrays only 
reflect mRNA levels, they do not take into account any genes affected at 
the translational level, and so at present it is not clear how extensive the 
problem of off-target effects really is. Given that the application of syn- 
thetic siRNAs results in transient inhibition of gene expression, specific 
off-targeting may not be a major concern for many clinical applications. 
Nevertheless, appropriate toxicity testing should take into account the 
potential for a particular siRNA to target 3’ UTRs in non-target genes. 

Some strategies can be used in siRNA design to minimize the problem 
of off-targeting. For instance, it has been shown that 2’-O-Me modifica- 
tions” or DNA substitutions® in siRNA duplexes can significantly reduce 
off-target effects. It would also be valuable to improve antisense-strand 
selectivity by taking into account thermodynamic stability (see ‘Superior 
designs for small molecules’) or by blocking the 5’ phosphorylation of 
the sense strand”. 

RNAi is a widely conserved mechanism that may originally have 
evolved to combat viral infections. As such, it is perhaps not surprising 
that in some cases siRNAs can act as agonists of Toll-like receptors” and 
that specific sequence motifs, such as uridine-rich regions and guanos- 
ine- and uridine-rich regions, can induce cellular immune responses”, 
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The ability of an siRNA to stimulate cellular immune responses is based 
not only on specific sequences but also on structure, the type of deliv- 


ery system used and the cell type”. Although the immunostimulatory 


potential of siRNAs could be advantageous in certain circumstances”, 
it is usually an unwanted outcome. The above-mentioned finding of 
the TLR3 response to non-sequence-specific modulation of VEGF or 
the VEGF receptor”, as well as a separate report showing that a macro- 
phage migration inhibitory factor (Mif)-targeted siRNA (in a murine 
model) anda nonspecific control siRNA increased the proliferation of 
breast cancer cells through activation of dsRNA-activated protein kinase 
(PKR)*, raise serious concerns in interpreting the results of in vivo 
siRNA applications. 

Although we have yet to reach a universal solution for avoiding all off- 
target effects, it is foreseeable that these problems will be overcome by the 
use of appropriate backbone modifications, as well as delivery systems 
that can mask RNAs from the receptors of the innate immune system™. 


Gazing ahead 

Despite the technique’s youth, the list of diseases for which RNAi is 
being tested as a therapeutic agent is extensive, and includes Parkinson's 
disease, Lou Gehrig's disease, HIV infection, wet age-related macular 
degeneration, type 2 diabetes, obesity, hypercholesterolaemia, rheuma- 
toid arthritis, respiratory diseases and cancers. It is already a multimillion 
dollar business, projected to reach US$1 billion by 2010, and intellectual 
property rights will become an increasingly important concern in the 
coming years. 

However, although much has been accomplished, obstacles remain 
that will hamper the race to the clinic. The ultimate goal of achieving 
RNAi-based therapies for life-threatening or debilitating diseases cannot 
be attained without improving the safety, effectiveness and reliability of 
RNAi-trigger delivery systems. The use of targeted delivery strategies that 
permit systemic delivery will be a big step towards fulfilling this difficult 
task. The development of new, noninvasive imaging methods to monitor 
the in vivo delivery of siRNAs, such as labelling with near-infrared dyes”, 
will aid studies of tissue uptake and biodistribution. 

Although RNAi is not yet an accepted therapeutic modality, the enor- 
mous interest in this phenomenon ensures that we will soon witness fast 
advances and new applications for RNAi-based therapies. Given the 
way that RNAi has transformed basic research and the unprecedented 
speed with which it has reached the clinic, the coming years promise to 
be exciting. a 
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Changes in the phase of the annual cycle of 
surface temperature 


A. R. Stine’, P. Huybers” & |. Y. Fung? 


The annual cycle in the Earth's surface temperature is extremely large—comparable in magnitude to the glacial-interglacial 
cycles over most of the planet. Trends in the phase and the amplitude of the annual cycle have been observed, but the causes 
and significance of these changes remain poorly understood—in part because we lack an understanding of the natural 

variability. Here we show that the phase of the annual cycle of surface temperature over extratropical land shifted towards 
earlier seasons by 1.7 days between 1954 and 2007; this change is highly anomalous with respect to earlier variations, which 
we interpret as being indicative of the natural range. Significant changes in the amplitude of the annual cycle are also 

observed between 1954 and 2007. These shifts in the annual cycles appear to be related, in part, to changes in the northern 
annular mode of climate variability, although the land phase shift is significantly larger than that predicted by trends in the 


northern annular mode alone. Few of the climate models presented by the Intergovernmental Panel on Climate Change 
reproduce the observed decrease in amplitude and none reproduce the shift towards earlier seasons. 


Climate change is often described by trends in annual mean temper- 
ature, but large seasonal temperature changes exist independent of 
changes in the annual mean. A small literature exists concerning the 
variability in the phase of the annual cycle. Thomson' examined the 
Central England Temperature time series (1659-1990), and identified 
a trend in the phase of the annual cycle towards later seasons, beginning 
around 1950, that is anomalously large in the context of the preceding 
several-hundred-year record. He argued that this excursion is assoc- 
iated with increases in atmospheric CO, concentration. He also 
presented evidence of trends in the phase of the annual cycle over larger 
spatial scales and an increase in the spatial variance of the trends. Mann 
and Park’ and Wallace and Osborn’ demonstrated that the hemi- 
spheric averaged observations contain trends in amplitude and phase. 
The amplitude trend is negative and is related to the observation that 
winter is, on average, warming more quickly than summer*®. The 
hemispheric phase trend, however, is towards earlier seasons, opposite 
in direction to that found by Thomson! for central England. 

The importance of these observed amplitude and phase trends is 
hard to judge because we lack a good model for natural variability. 
Wallace and Osborn’ used two criteria to evaluate whether the 
observed trends are unusual: (1) a statistical test for the presence of 
a trend and (2) a comparison of trends with natural variability as 
represented in a general circulation model. Neither of these 
approaches is altogether satisfactory. We expect low-frequency 
variability always to be present, so the presence of a trend in and of 
itself is not surprising’. Furthermore, general circulation models may 
not give us an accurate picture of low-frequency variability, particu- 
larly in phase, because of two shortcomings. First, the models that 
have been used to evaluate phase and amplitude variability have used 
seasonal heat and freshwater flux adjustments to match the mean 
annual cycle, which may artificially stabilize the modelled annual 
cycle. Second, and more troubling, Northern Hemisphere phase 
trends predicted by models forced with twentieth-century anthro- 
pogenic forcing are in the opposite direction to the observed trend””. 
Modelled Northern Hemisphere amplitude trends also disagree with 
observations when compared using a temporally fixed network’. 


Models that are unable to replicate observed trends are clearly not 
ideal for constraining the range of natural variability. Instead, we 
appeal to the early observational record to estimate the natural spatial 
and temporal variability of the seasonal cycle and ask if the trends 
observed in the recent record are anomalous in nature. 


The basic state of the annual cycle 

Two distinct temperature-based methods for discussing the timing of 
the seasons have been used in the literature. The more common is a 
threshold-based model wherein seasonal transitions are defined as the 
times of year when the temperature rises above or drops below some 
specific value. In this framework, the ‘spring’ threshold will be reached 
earlier if temperature increases uniformly through the year. This type 
of change is of first-order importance for explaining changes in 
seasonality observed both in biological systems (for example flowering 
dates*”, bird migration timing®*’ and terrestrial surface carbon 
uptake’’) and in components of climate that exhibit threshold res- 
ponses (for example the freezing and melting of ice''). However, 
threshold-based definitions conflate changes in the phase of the 
annual cycle with changes in the annual mean (see Supplementary 
Information). We instead describe the seasonal cycle by the amplitude 
and phase of the yearly-period sinusoidal component in surface tem- 
perature, a measure of seasonality that is distinct from changes in the 
annual mean'*'’, and reference it to the yearly-period sinusoidal 
component in local solar insolation. The difference between the tem- 
perature and local insolation phases (A = $7 — gyn) is the lag’’, and 
the ratio of the amplitudes (G = Ay/A,y,) is the gain (see Methods 
Summary). We examine gridded 5° X 5° temperature records from 
the University of East Anglia’s Climate Research Unit'*”’ and analyse 
long-term mean, detrended variability and trend fields for land (Ajana 
Giana) and ocean (Agcean» Gocean)- 

The spatial patterns of A and G (Fig. la, b) are dominated by the 
contrast between land and ocean. The larger ocean thermal mass causes 
it to respond more sluggishly to oscillatory forcing than land, which 
results in a smaller and later oceanic response. Ocean points have a 
mean gain of 28°C (kW m *) ' (standard deviation of point-wise 


'Department of Earth and Planetary Science, University of California, Berkeley, California, 94720, USA. 7Department of Earth and Planetary Sciences, Harvard University, Cambridge, 


Massachusetts, 02138, USA. 
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Figure 1| Lag and gain fields. a, c, e, Phase lag, A; b, d, f, amplitude gain, G. 
We plot long-term-mean value (a, b), temporal standard deviation of the 
detrended time series (c, d); and trend in days per 54 years (e) and 
°C(kWm ”)! per 54 years (f). Both variability and trend maps are plotted 
on the ‘dense network’ (1954-2007), without land and ocean masks applied. 
Results have been excluded in the tropics, where data availability is poor, and 
where less than 85% of the variance in an average year is explained by the 
yearly component. 


means, ¢,, = 15°C (kW m ”)~’) and a mean lag of 56 days (o, = 11 
days); in comparison, the more rapidly adjusting land has a mean gain 
of 74°C (kWm 7)! (6, = 23°C (kW m ’) ') anda mean lag of 29 
days (a, = 6 days). 

Superimposed on the dominant land—ocean contrast is an east— 
west gradient in G and J. As we move from west to east (following the 
prevailing winds) across the mid-latitude continents, there is a tend- 
ency towards a more rapid response (large Giang) small Aang). This 
cross-continent gradient in Gjana is quite strong, whereas the gradient 
in Ajana is relatively weak (Ajanq adjusts rapidly to interior values along 
the western continental margin'”). Conversely, as we move from west 
to east across the mid-latitude ocean basins, there is a tendency 
towards a more sluggish response (small Goceans large Aocean)s and 
the relative strengths of the Gocean and Acgcean gradients is reversed 
relative to that of the land (G,ccan adjusts rapidly to interior values 
along the western margin of ocean basins). 

The role of land-sea contrast in setting the climatological distri- 
bution of the annual cycle is not a new observation’””’”’, but its 
dominance is particularly obvious when considering the relationship 
between Gand J. Pairs of G and / fall along an arc (Fig. 2a). We define 
a ‘seasonal response index’ to represent a point’s position in this lag— 
gain space as 


2— min (A) 


—  G—min(Y) 
a max (G)— min (Q) 


max (A)— min (A) 
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and find that 75% of the variance in this index is explained by the 
distance between a grid point and the coast to its west, where distance 
is taken as positive for land and negative for ocean (see 
Supplementary Information for more discussion on the structure 
of variability). The relationship between SRI and distance from the 
coast holds best in Eurasia, where southern mountains constrain the 
transport to be zonal and isolate the interior from tropical moisture. 
Deviations from this general east-west pattern are found in regions 
where transport is less zonal, such as the southeastern North 
American monsoonal region, where there is strong poleward mois- 
ture transport onto land, and in the western United States, where the 
north-south alignment of the Rocky Mountains effectively blocks 
oceanic influence from the Pacific Ocean. 

The observed arc in the relationship between G and 2 is a ubiqui- 
tous feature of seasonally driven models that contain interacting land 
and ocean regions, and can be understood as the natural consequence 
of interacting sinusoids. Consider two sine waves with different 
phases and amplitudes, S; = Asin(#) and S) = (A/r)sin(@ + Ad). A 
weighted average of the two sine waves, wS, + (1 — w)S), with 
0<w<1, yields a sine wave with amplitude 


A 
Ay = [w+ 2rw(l w) cos Ad+(1—w)y* 
r 


and phase 


$y =64 cos! (2 (» “(1 w)cosé) ) 


A mixing line using this equation (Fig. 2a) is consistent with the general 
form of the observed arc. Apparently, the spatial structure associated 
with the seasonal cycle can be understood, to first order, as the result of 
variable mixing between continental and marine influence. 

Variability in G (Fig. 1d) tends to be largest where the climato- 
logical G is large (coefficient of correlation, R = 0.83), and is about 
twice as large over land (mean of point-wise standard deviations, 
@=5.2°C(kWm 7“) ') as it is over the ocean (@=2.5°C 
(kWm *)~'). Conversely, temporal variability in 1 (Fig. 1c) is cor- 
related with G-' (R= 0.62) and is larger over the ocean (@ = 5.0 
days) than it is over land (6 = 4.0 days). The inverse relationship 
and larger 7,cean variability arises because finite perturbations more 
readily alter the phase of a smaller amplitude sinusoid (see the 
Supplementary Information discussion on natural variability). We 
thus expect that it will be more difficult to detect the presence of any 
true phase trend over the ocean. 


Trends in the phase and gain of the annual cycle 


The 1954-2007 Alana trends (Fig. le) are predominantly towards 
earlier seasons, with a mean decrease of 1.7 days (that is, 6%) over 
the past 54 years. The A,cean trends are large but regionally disparate. 
For example, the interior of the North Pacific, and the Atlantic north 
of 50°N, exhibit trends towards later seasons, whereas along the 
eastern edge of the North Pacific, and in the North Atlantic south 
of 50°N, trends are primarily towards earlier seasons. The mean 
Aocean Shift is towards later seasons by 1.0 days over the past 54 years. 

A comparison of trend maps (Fig. le, f) and variability maps 
(Fig. 1c, d) reveals that the trends are large where the detrended 
variability is large. This suggests the obvious null hypothesis that 
the trends are merely a manifestation of natural variability. One test 
for whether the trends observed in the recent record are consistent 
with natural variability is to compare them with trends observed in 
earlier periods. We consider the distribution of point-wise trends 
(Fig. 2b) for the 1900-1953 and 1954-2007 intervals using those 
records which have good temporal coverage during both intervals 
(see Methods; this is the default distribution of records that we use 
below, unless specifically stated otherwise). Land and ocean are con- 
sidered separately because the characters of their annual cycles are so 
different. 
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Figure 2 | Mean annual cycle and distribution and character of trends. 
a, Observed relationship between local gain, G, and lag, 1, for Northern 
Hemisphere extratropical locations. Colour represents the distance between 
a grid point and the coast to its west (positive for land, negative for ocean). 
Outliers with 2 < 20 days are from the Indian subcontinent and presumably 
reflect monsoon dynamics. The black line shows the nonlinear relationship 
between amplitude and phase for weighted averages of two end-member sine 
waves (see Supplementary Information). b, Normalized histograms of 
point-wise / trends (in days per 54 years). Red lines represent land; blue lines 


We adopt a null hypothesis that the mean of each distribution of 
trends is zero. Testing this null hypothesis requires a knowledge of the 
effective spatial degrees of freedom***, and we use estimates 
obtained from the moment-matching method of ref. 25 (see 
Methods). Of the four distributions of 7 trends that we consider, 
only those over land during the 1954-2007 interval have a mean that 
differs significantly from zero (Table 1), and here the significance is 
marked (—1.9 + 1.0 days per 54 years, P< 0.001). Repeating the tests 
for 1954-2007, using the larger spatial network that is available for 
this interval (the dense network; see Methods), supports the signifi- 
cance of the Ajang trend (—1.7 + 0.8 days per 54 years, P< 0.001). We 
also detect a significant 1954-2007 A,cean trend towards later seasons 
(1.0 + 0.9 days per 54 years, P= 0.02) that is only detectable in the 
more extensive dense network. 

The dominant signal in the 1954-2007 G trend (Fig. 1f) is a decrease 
in the amplitude of the annual cycle over land, averaging 
—2.5°C(kWm ”) | over the past 54 years on the dense network, a 
3% drop. This is the well-known amplification of winter warming**”®, 


Table 1| Means of trend distributions 
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represent ocean. The dotted lines give the distributions for the control 
period (1900-1953) on the ‘comparison network’. The thick solid lines give 
the distributions for the same spatial network for 1954-2007. The thin solid 
line is for the dense network (1954-2007; see Methods for network 
descriptions). ¢c, Same as b, but for G trends (in °C (kW m ”) a per 54 years). 
d, Anomalies in composite land annual-cycle shape for 27-year periods, 
relative to the 108-year composite. Southern Hemisphere grid boxes have 
been shifted by six months before averaging. 


which is strongest in the interior of Eurasia and in the boreal forests of 
western Canada. Note, however, that large regions exist where the 
amplitude has increased. In western Europe and the Middle East, 
the observed increase in G is associated with greater warming in 
summer than in winter. In the central North Pacific and the south- 
eastern United States, the increase in G results from winter cooling. In 
Quebec, the summer warming and winter cooling trends are of com- 
parable magnitudes, leaving little trend in mean temperature but a 
measurable increase in G. Ocean G trends are almost everywhere small 
and show a mean increase of 0.4°C(kWm *)~! over the past 54 
years. 

We apply tests to the G trends similar to those made on the / trends 
(Fig. 2c). Of the four distributions of G trends, only those over land 
during 1954-2007 have a mean that differs significantly from zero 
(—2.6+2.4°C(kWm 7)! per 54 years, P< 0.03; Table 1). If the 
dense network is used instead for this interval, the Gj,,q trends remain 
significant (P = 0.05) and the Gocean trends emerge as being marginally 
significant (P = 0.07). It is noteworthy that the well-reported changes 


Comparison network 


Dense network 


1900-1953 1954-2007 1954-2007 
land 0.31 + 0.95 (P=0.5) —1.88 + 0.95 (P< 0.001) —1.66 + 0.81 (P < 0.001) 
ocean 0.72 + 1.24 (P=0.2) 0.00 + 1.23 (P= 1) 1.02 + 0.87 (P = 0.02) 
land 0.23 + 2.21 (P=0.8) —2.62 + 2.40 (P = 0.03) —2.54 + 2.54 (P = 0.05) 
Gocean 0.44 + 0.82 (P= 0.3) 0.27 + 0.69 (P= 0.4) 0.43 + 0.47 (P= 0.07) 


0.54 + 0.37 (P = 0.007) 
0.79 + 0.28 (P < 0.001) 
0.60 + 0.96 (P= 0.2) 
0.72 + 0.33 (P < 0.001) 
—0.06 + 0.83 (P= 0.9) 
0.07 + 0.32 (P= 0.6) 


Summer land temp. 
Summer ocean temp. 
Winter land temp. 

Winter ocean temp. 
Summer-winter land temp. 
Summer-winter ocean temp. 


0.86 + 0.38 (P < 0.001) 
0.60 + 0.26 (P< 0.001) 
1.66 + 1.00 (P = 0.005) 
0.46 + 0.29 (P = 0.005) 
—0.80 + 0.95 (P= 0.09) 
0.14 + 0.28 (P= 0.3) 


0.96 + 0.35 (P < 0.001) 
0.48 + 0.18 (P < 0.001) 
1.77 + 1.40 (P = 0.02) 
0.41 + 0.19 (P< 0.001) 
—0.81 + 0.94 (P= 0.08) 
0.07 + 0.21 (P=0.5) 


Phase lag (A) trends are in expressed in days per 54 years, amplitude gain (G) trends are expressed in °C (kW m 


= 


Two-tailed P values are given in parentheses. Significant values, judged using 95% confidence intervals, are set in bold. 
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in the amplitude of the annual cycle’”” are less significant than the 


less-reported land-phase trend. The low significance of the amplitude 
results is related to the large natural variability in wintertime temper- 
ature. Winter warming is considerably stronger than summer warm- 
ing over land during 1954-2007, but the variance in winter land 
temperatures is about four times that in summer land temperatures, 
making the winter trend less significant and making detection of 
changes in amplitude difficult**. In fact, we are unable to detect a 
significant difference between summer and winter warming when 
temperature trends are analysed as the difference between three- 
month seasonal averages (Table 1). 

We focus on the Ajang trends because their significance is markedly 
higher than that of any other observed trend. Furthermore, G trends are 
more easily discussed in the seasonal-average-temperature framework 
than are A trends and have received more attention elsewhere”*””. 

There are two steps in establishing the presence of an anomalous 
trend. The first is establishing that a trend is statistically distinguish- 
able from zero, which we demonstrated for the 1954-2007 Alana 
observations. The second is establishing that this trend is different 
in character from what we would expect in the naturally varying 
system, which is more difficult given the finite length of the instru- 
mental temperature records. We are particularly concerned about 
low-frequency variability being misinterpreted as an anomalous 
trend’. The absence of a significant Aj,nq trend for the 1900-1953 test 
period indicates that the trend during 1954-2007 is anomalous. By 
restricting ourselves to a smaller set of locations, we can also extend 
our analysis back to 1850. We construct an average Ajanq time series by 
averaging the phase time series from all of the land grid boxes with 
perfect temporal coverage between 1850 and 2007, and adopt the null 
hypothesis that the 1954-2007 trends result from natural low- 
frequency variability as represented in the 1850-1953 record. We 
build a distribution for this null hypothesis by calculating the trends 
of many synthetic time series having the same spectral amplitude 
structure as the 1850-1953 record, but with randomized phases” 
(see Supplementary Methods), and are able to reject it with very high 
confidence (P = 0.006). The phase trend over the past 54 years is not 
consistent with the structure of natural variability found in the earlier 
record. Furthermore, there is no 54-year period in the 1850-1953 
control period that would allow rejection of this null hypothesis. 
(Note that we are unable to meaningfully compare the 1954-2007 
trends in Agcean With the 1850-1953 period because instrumental 
coverage over the ocean during these early times was poor.) 

Finally, it is reasonable to ask whether the observed variability is 
really best thought of as a shift in the yearly sinusoidal component, or 
if it would be better described by changes in individual months. A 
change in a single month’s temperature will map into a shift in the 
annual cycle, although the yearly frequency component provides a 
poor description of such an anomaly. We calculate the mean annual 
cycles for four 27-year periods, using land grid boxes with good 
temporal coverage between 1900 and 2007 (see Supplementary 
Methods), and consider their anomalies from the 108-year mean 
annual cycle (Fig. 2d). (Consideration of the means in these four 
periods gives insight into the origins of the trends in the 1900- 
1953 and 1954-2007 intervals.) The most recent anomaly time series 
(1981-2007) exhibits the largest departures from the long-term 
mean, and 80% of its variance is explained by the yearly component. 
The most recent period has more variability at the annual period than 
the total variability during all preceding periods, highlighting both 
that these shifts are well described by a yearly sinusoidal component 
and the anomalous nature of the recent changes (see Supplementary 
Table 2). 


Origins of the changes in the annual cycle 

To explore the origins of the shifts in Giang and Alana, We first turn to 
the global climate model results summarized in the Fourth 
Assessment Report of the Intergovernmental Panel on Climate 
Change (IPCC)**. In particular, we analyse the 72 simulations of 
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twentieth-century climate that use observed forcings conducted for 
the World Climate Research Programme’s (WCRP) Coupled Model 
Intercomparison Project Phase 3 (ref. 31). The distributions of mean 
trends in Giang and Ajana found in the models easily encompass the 
observed trends during the 1900-1953 interval. However, during the 
1954-2000 interval (the simulations stop at the end of the century), 
the observed decrease in G),,4 has a larger magnitude than all but six 
of the model simulations, and no model reproduces the observed 
shift towards earlier seasons (Fig. 3). The mean of the model Ajang 
trends for 1954-2000 is towards later, not earlier, seasons. 
Furthermore, 25 atmospheric model runs forced with observed sea 
surface temperatures for the 1978-2000 period (the Atmospheric 
Model Intercomparison Project models**) do no better at replicating 
the observed Ajanq trends (see Supplementary Methods and 
Discussion). 

The IPCC model results do not appear to give us an explanation of 
the observed trends, except to suggest that the answer involves some- 
thing that the models do not capture. We thus retreat to a simple, 
conceptual model to explore how local processes may cause variability 
in A and G consistent with the observations. Our goal here is to explore 
some obvious candidates and roughly estimate the size of the per- 
turbation needed to explain the observation. We use a one-box energy 
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Figure 3 | Modelled and observed mean land trends. a, Observed 
1900-1953 land lag, Ajana and gain, Giang, trends and those from WCRP 
‘Climate of the Twentieth Century’ simulations, sampled at the same 
locations. Marks with same colour and shape indicate multiple runs with the 
same model. The large black cross indicates the actual observed trends 
(‘Instrumental Obs.’). b, Same as a, but for 1954-2000. The comparison ends 
at 2000 because IPCC runs generally stop then. Individual model 
descriptions are given in ref. 50 and references therein. 
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balance model based on ref. 33, forced with sinusoidally varying short- 
wave radiation, with atmospheric short-wave optical properties cal- 
culated following ref. 34 (see Methods). 

Many of the mechanisms invoked to explain variability in annual 
mean temperature are unlikely to be directly responsible for the 
observed shift in phase. Doubling the atmospheric long-wave optical 
depth to simulate the radiative effect of a very large increase in green- 
house gases has essentially no effect on seasonal timing (A/ = 0.1 
days, AG = —0.6°C (kW m ’)~'). Increasing solar luminosity by a 
fixed percentage increases the amplitude of the temperature response 
by the same percentage and has a negligible effect on phase’. 
Decreases in sea ice (not represented in our model) present the 
atmosphere with a larger thermal mass, implying a delayed seasonal 
response (although threshold responses at the time of spring melt 
may induce changes in the opposite direction). Consistent with this 
intuition, the (incorrect) phase delays found in the model results of 
ref. 2 are attributed to decreases in sea-ice cover. 

However, there exist numerous mechanisms that may shift the 
seasonal cycle in the observed direction. A decrease in thermal mass 
on land of 8 + 4% is sufficient to produce the observed offset in phase 
of 1.7 + 0.8 days. Thermal mass on land is largely modulated by soil 
moisture. Compare, for example, the effective thermal mass of a dry 
desert sand (1.9Jm *°C~') with that of a saturated loam soil 
(3.2Jm~°°C7'). For a typical soil, the observed phase shift would 
require a 13+7% decrease in soil moisture. IPCC Fourth 
Assessment Report model runs disagree with each other on the sign 
of recent soil moisture trends and show little skill at explaining the 
(sparse) observations*®. The few available long-term measurements 
suggest increased soil moisture over the latter part of the twentieth 
century’’”*, which is inconsistent with the thermal mass hypothesis, 
although we observe that drought reconstructions” indicate these 
observations may not be representative of continental-scale variations. 
The paucity of records with more than 40 years of data prohibits a 
more detailed comparison. We consider large-scale decreases in soil 
moisture to be a viable candidate for inducing the observed shift 
towards earlier seasons. 

Perturbations to atmospheric short-wave optical properties are also 
effective at modifying the annual cycle, and it appears that short-wave 
absorptivity has been changing, perhaps because of aerosols*”“’. The 
Earth’s short-wave optical properties are not constant throughout the 
year, and their potential range of variability is not captured by this 
simple model. Nonetheless, the model indicates that variability in 
atmospheric annual mean reflectivity, absorptivity or transmissivity 
on the order of 10% will change 21,4 by the observed amount. Note 
that Wallace and Osborn’ were unable to replicate the observed hemi- 
spheric phase shifts using a general circulation model, but that the 
inclusion of aerosol forcing did decrease the modelled (incorrect) shift 
towards later seasons. We see no indication of shifts in mean Ajanq after 
any of the major volcanic eruptions of the past century, although the 
effects of stratospheric and tropospheric aerosols on phase are likely to 
be quite different. 

Thomson' makes the intriguing, although difficult-to-evaluate, 
proposal that decreases in phase are due to an increased local sens- 
itivity to anomaalistic year forcing (associated with the annual cycle in 
Earth-Sun distance) relative to tropical year forcing (due to the 
annual cycle in the orientation of the Earth’s rotation axis relative 
to the Sun). 

The above-mentioned changes in albedo, soil moisture and short- 
wave forcing have all been implicated in changing modes of atmo- 
spheric circulation**’. This raises the further possibility that shifts in 
atmospheric circulation participate in the modulation of the annual 
cycle. We focus on the northern annular mode (NAM) and the Pacific 
North American pattern, as these have been shown to represent the bulk 
of the variability in standard atmospheric climate indices*® (but see 
Supplementary Information for a more complete analysis). The NAM 
shows significant cross-correlations with time series of 1950-2007 
spatially averaged Jiang (R= —0.5, P<0.001) and Giang (R= 0.42, 
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P= 0.007), whereas the Pacific North American pattern has significant 
correlation with Gocean (R= 0.3, P= 0.04). Apparently, atmospheric 
dynamical processes respond to similar forcing mechanisms as jana 
or themselves participate in altering 4),,q through the advection of heat 
and moisture or other indirect processes. Northern Hemisphere snow 
cover, for example, is known to interact with the NAM*”, and wind- 
driven changes in mixed layer depth affect the thermal mass that the 
ocean presents to the atmosphere**. 

The Atmospheric Model Intercomparison Project simulations 
have been found to capture the spatial pattern, but not the temporal 
pattern, of NAM variability”’, just as we find that the models fail to 
capture the long-term trends in phase. However, the recent phase 
excursion appears to be only partly explained by the late-twentieth- 
century excursion in the NAM. A regression of the spatial average 
Aiand time series against the NAM index from 1950-2007 removes 
25% of the variance and 40% of the trend in the 4),,4 time series, but 
still leaves a significant trend (P< 0.02) of — 1.0 days per 57 years (see 
Supplementary Information). 

The statistics of the distribution of Aj,,q trends are well described as 
natural variability from 1900-1953, but the distribution shifts in 
1954-2007, the period in which anthropogenic interference with 
mean temperature becomes apparent. If we extend our natural con- 
trol period back to 1850, the recent trends appear yet more anoma- 
lous. Numerous climate factors can influence the phase of the annual 
cycle, and it appears that some portion of the trend in the annual 
cycle is associated with changes in the NAM during the late twentieth 
century. We expect that a complete explanation for long-term shifts 
in atmospheric circulation will also encompass an explanation of the 
variability in the phase of the annual cycle. Although the mechanism 
is still uncertain, the tests we apply to the 1954-2007 trends in land 
phase indicate that they are inconsistent with natural variability, and 
thus appear to be due to anthropogenic influence. 


METHODS SUMMARY 


For each year of data, we calculate the yearly (one cycle per year) sinusoidal 
component using the Fourier transform, as 


2 11.5 


x 
12 t=0.5 


errit/ 2g 4 rm) 


where x(t + fo), f= 0.5, ..., 11.5, are 12 monthly values of either the de-meaned 
monthly temperature or de-meaned monthly insolation and tis time in months. 
The factor of two is to account for both positive and negative frequencies. Phase 
is given by ¢,. = tan '(Im(Y,)/Re(Y,)). To discuss both hemispheres in a com- 
mon framework, we reference the temperature phase, ¢y, to the local solar 
insolation phase, (un. The difference between these two phases is the lag’’, 
A= br _ suns 

Amplitude is given by A, = |Y,|. For the purpose of understanding the res- 
ponse of the Earth’s temperature to forcing, we examine the gain, which is 
defined as the ratio of the amplitudes of the annual cycles in temperature and 
insolation, G = Ay/Agyy. Unlike insolation or temperature amplitudes alone, G 
has very little latitude dependence. 

If any of the 12 monthly temperature values is missing in the data set at a given 
location, then no estimate of the annual cycle is made at that location for that 
year. Analyses using longer record pieces and more advanced filter techniques do 
not change our conclusions regarding the significance of phase and amplitude 
changes. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Data sets. When plotting full spatial fields (Figs 1, 2a), we use the HadCRUT3 
blended land-and-ocean 5° X 5° gridded surface temperature anomalies'® plus 
gridded climatology’’ from the Climate Research Unit at the University of East 
Anglia. For comparison with IPCC model archive output (Fig. 3), we use the 
HadCRUT3 data set (because models do not calculate separate land and ocean 
surface temperatures), but restrict ourselves to grid points that are more than 
50% land. When land and ocean are considered separately (all other calcula- 
tions), we instead use the CRUTEM3 (for land) and HadSST2 (for ocean) data 
sets so that the land and ocean signals are more cleanly separated. Dominantly 
land and ocean boxes are identified using the Clark US Navy Fleet Numerical 
Oceanographic Center Land/Ocean Mask’'. Because a grid box with land cover 
of only a few per cent is not as representative of land as a continental interior box, 
only CRU grid points that are more than 50% land in the Navy mask are used in 
land calculations. Those that are less than 50% land are considered ocean. 
Data mask. We desire a high ratio of signal (annual amplitude) to noise (errors in 
observations and high frequency temperature variability), so that we can isolate 
variability that is associated with changes in the annual cycle. In fact, the yearly 
sinusoidal component dominates the extratropical records; on average, it explains 
96% and 90% of the within-year variance in monthly temperatures for land and 
ocean grid boxes, respectively. With the time series of yearly G and / at each grid 
box, we estimate the long-term means, the long-term trends and the standard 
deviations of the departures from the long-term trend. To do so, we exclude from 
analysis (1) those extratropical grid boxes where less than 85% of the average 
within-year variance is explained by the yearly sinusoidal component (primarily 
the Southern Ocean) and (2) all tropical grid boxes (23.5° S to 23.5° N), because 
the two-cycles-per-year harmonic in forcing and response is strong in this region. 
For calculating long-term-mean / and G, we exclude grid boxes with fewer than 
ten yearly estimates over the entire record. For calculating 54-year trends and 
detrended standard deviation in G and A, we exclude grid boxes with fewer than 40 
yearly estimates. Trends are calculated using a least-squares fit. For comparing 
1900-1953 and 1954-2007 trends, we use a ‘comparison network’ of grid boxes 
that meet these data-density criteria for both periods (180 land points, 120 ocean 
grid points). We also use a ‘dense network’ of all of the grid boxes that meet the 
data inclusion criteria for 1954—2007, to obtain a best estimate for the most recent 
period (299 land points, 345 ocean points). The dense network has good spatial 
coverage between 25° N and 60° N (with some missing values in the interior of 
Eurasia and at higher latitudes) and more sporadic coverage between 25° S and 
40° S. For the comparison network, all of the Southern Ocean, most of the Pacific 
Ocean and much of Asia are excluded. 

Trend distribution testing. Tests for the deviation of distribution means from 
zero are done using the test (two-tailed) and confidence intervals are t-intervals. 
The standard deviation for the t-test is calculated from the observed distribution 
and the degrees of freedom are estimated as the effective spatial degrees of 
freedom (ESDOF) of the time-varying field using the moment-matching method 
of ref. 25, which they describe as appropriate when testing for the difference of a 
realization from the mean. This method estimates 21 (Ajang)» 19 (Agcean)» 12 (Giana) 
and 20 (Gocean) ESDOF for / and G variability. For the late dense network, the 
estimates are 29 (Ajanq), 58 (Agcean)s 12 (Giana) and 47 (Gocean) ESDOF. For summer 


nature 


temperature field variability we use 15 (land) and 9 (ocean) ESDOF for the com- 
parison network and 17 (land) and 30 (ocean) ESDOF for the dense network. For 
winter temperature field variability we use 8 (land) and 9 (ocean) ESDOF for the 
comparison network and 6 (land) and 31 (ocean) ESDOF for the dense network. 
For seasonal-difference (summer temperature minus winter temperature) hypo- 
thesis testing we use ESDOF values calculated from fields of annual mean tem- 
perature and we use values of 10 (land) and 9 (ocean) ESDOF for the comparison 
network and 12 (land) and 22 (ocean) ESDOF for the dense network. Recovered 
ESDOF estimates are comparable to the observation-based estimates of ref. 23 and 
are notably smaller than the model-based estimate of ref. 24. For testing the 
average summer and average winter trend distributions, summer is defined as 
June, July and August in the Northern Hemisphere and as December, January 
and February in the Southern Hemisphere. Winter is defined as December, 
January and February in the Northern Hemisphere and June, July and August 
in the Southern Hemisphere. 

Energy balance model. The one-box conceptual model is a one-atmospheric- 
layer energy balance model, with a black-body surface and a black-body atmo- 
sphere, forced with sinusoidally varying short-wave radiation (S = Scos(21t)) 
characteristic of the annual cycle in radiation at 40° N. We add two complications: 
(1) to consider sensitivity to atmospheric optical properties, we specify atmo- 
spheric short-wave absorptivity (A = 0.15), transmissivity (J = 0.6), and reflec- 
tivity (R = 0.25), and calculate the effects of multiple reflections following ref. 34; 
(2) to consider the effects of increasing long-wave optical depth (t), we calculate a 
different atmospheric upward-radiating temperature (Ty, = T, — [H(In(3t/ 
2) —1)) and downward-radiating temperature (Tgown = T, + 1H), which are 
related to the interior atmospheric temperature (T,) by the atmospheric height 
(H) and lapse rate (/), following ref. 33 (ch. 5). Surface temperature (T,) tendency 
is a function of the sum of energy fluxes divided by the thermal mass of the surface 
(c,). On land, the depth of soil that contributes to the thermal inertia is estimated 
as the square root of the soil diffusivity times the timescale in question, and for 
annual timescales we use a depth of 4.7 m in calculating c. 

The surface energy budget is then 


iG ors = =7) +0Ty, 


ar 1-%R a 


4 
—oT, 


and the atmospheric budget is 


AT a; 
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Ge ~ s(A+ ) +oT! oT ey OT own 
where «, is the surface albedo. The model is run for parameter values typical for 
land, and we then perturb these values to estimate sensitivity. 

Thermal mass changes are equated with soil moisture changes assuming a soil 
consisting of 10% inorganic matter, 45% organic matter, 5% unfilled airspace 
and with a soil water content of 40%. A 13% drop in soil moisture then implies 


that the soil water content drops to 35%. 


51. Cuming, M. J&. Hawkins, B. A. TERDAT: The FNOC System for Terrain Data 
Extraction and Processing. Tech. Rep. Mil Project M-254 (second edition); 
prepared for USN/FNOC (Meteorology International, 1981). 
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The nature of the globular- to fibrous-actin 
transition 


Toshiro Oda!”, Mitsusada Iwasa’, Tomoki Aihara!, Yuichiro Maéda*® & Akihiro Narita® 


Actin plays crucial parts in cell motility through a dynamic process driven by polymerization and depolymerization, that is, 
the globular (G) to fibrous (F) actin transition. Although our knowledge about the actin-based cellular functions and the 

molecules that regulate the G- to F-actin transition is growing, the structural aspects of the transition remain enigmatic. We 
created a model of F-actin using X-ray fibre diffraction intensities obtained from well oriented sols of rabbit skeletal muscle 

F-actin to 3.3 A in the radial direction and 5.6 A along the equator. Here we show that the G- to F-actin conformational transition 
is a simple relative rotation of the two major domains by about 20 degrees. As a result of the domain rotation, the actin molecule 
in the filament is flat. The flat form is essential for the formation of stable, helical F-actin. Our F-actin structure model provides 


the basis for understanding actin polymerization as well as its molecular interactions with actin-binding proteins. 


Actin was discovered in muscle tissue by Straub in 1942 (ref. 1). The 
fibrous (F) form of actin is a major component of the thin filament in 
all muscle tissues. Since then, actin has been found in many eukaryotic 
cells to be the most abundant cytoskeleton protein, and it performs a 
broad range of important cellular functions in cell motility as well as in 
locating and transporting protein complexes in the cell. Actin plays 
these parts through the dynamic assembly and disassembly of struc- 
tures such as lamellipodia and filopodia, in a process referred to as 
actin dynamics”*. Actin exists in a dynamic equilibrium between 
monomeric G-actin and polymerized F-actin*. Actin strongly binds 
one adenosine nucleotide, and the polymerization involved in the G- 
to F-actin transition activates the ATPase. The ATPase activity drives 
actin filament treadmilling’, in which polymerization at one end and 
depolymerization at the other occur at the same time. Treadmilling is 
regulated by actin regulatory proteins, which mediate the actin 
dynamics. 

In 1990 the first crystal structure of G-actin was solved*; the F-actin 
atomic model was also proposed in a back-to-back paper’. The model 
has been widely accepted as an approximate low-resolution struc- 
ture. However, the conformational changes that occur at the G- to 
F-actin transition and the accompanying activation of the ATPase 
have remained elusive. A high-resolution structure is required to 
understand the detailed interactions between the subunits in the 
filament and the interactions between F-actin and actin-binding 
proteins. Here we describe a new high-resolution model for the 
F-actin structure, based on X-ray fibre diffraction data to 3.3A in 
the radial direction and 5.6 A along the equator, in contrast to the 
previous atomic models, which were based on fibre diffraction 
patterns with low resolution to about 8 A’. Our model shows that 
the essential difference between G-actin and F-actin is the relative 
rotation of the two major domains by about 20°, which gives the 
F-actin subunit a flat conformation. A second difference is the con- 
formation of the DNase-I binding loop (D-loop) in subdomain 2, 
which adopts an open loop conformation. There are no other large- 
scale differences. The flat conformation is also observed in the 
polymer of the bacterial MreB actin homologue”’, and thus it seems 
to be acommon characteristic of polymers among actin homologues. 
The G- to F-actin structural transition can account for a number of 


previous results about the consequences of modifications and muta- 
tions of residues'*"* as well as the binding of small molecules’®”” on 
polymerization. This model also provides a basis for understanding 
the structural stability of F-actin. 


Modelling of F-actin 


Our model of the F-actin structure is based on X-ray fibre diffraction 
analyses. An X-ray fibre diffraction pattern is essentially a section 
through the cylindrically averaged diffraction pattern from a single 
fibrous molecule. By analysing the diffraction, the structure can be 
deduced in favourable cases. In general, this is done by constructing a 
model and then calculating the expected diffraction pattern; by com- 
paring the calculated and observed diffraction patterns, a better 
model is eventually arrived at. However, cylindrical averaging leads 
to a significant loss of information. High resolution is essential for an 
unambiguous answer. To achieve high resolution, we combined 
several technical advances: we controlled the filament length by add- 
ing gelsolin, prepared well-oriented sols of F-actin'* by the use of an 
18 tesla superconducting magnet (actin filaments are diamagnetic), 
and recorded X-ray fibre diffraction patterns to 3 A at the SPring-8 
beam lines. From the extracted fibre diffraction data to 3.3 A in the 
radial direction and to 5.6 A along the equator, we constructed the 
model of the F-actin structure by altering the crystal structure of 
G-actin by use of the normal modes of actin and a molecular 
dynamics simulation, while monitoring the R-factor-non-fit, an 
equivalent of the free R-factor (Supplementary Fig. 1). Figure 1 shows 
that the diffraction pattern calculated from our F-actin model (left 
half of the image), especially the peak positions, is consistent with the 
observed pattern (right). 


Subunit conformation 

Actin has a nucleotide-binding cleft enclosed by two major domains 
(Fig. 2c)°. In almost all actin crystal structures, the cleft is closed and 
the two major domains are in a propeller-like twist with each other. 
This is characteristic of the G-actin conformation (yellow in Fig. 2a, 
b; Supplementary Fig. 2). In our F-actin model, we found a previ- 
ously unknown flat conformation, in which the cleft remains closed 


1X-ray Structural Analysis Research Team, RIKEN SPring-8 Center, RIKEN Harima Institute, 1-1-1, Kouto, Sayo, Hyogo 679-5148, Japan. 7ERATO project ‘Actin-filament dynamics’, 
Japan Science and Technology Agency (JST), 1-1-1, Kouto, Sayo, Hyogo 679-5148, Japan. *Structural Biology Research Center and Division of Biological Science, Graduate School of 


Science, Nagoya University, Furo, Nagoya 464-8601, Japan. 
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Figure 1| Comparison between the observed (right half) and calculated 
(left half) diffraction patterns from our model. The area indicated here, 
within the outermost boundary of 1/3.3 A ! in the radial direction and of 
1/5.6A _' along the equator, was used for the F-actin modelling. The 
diffraction pattern was calculated by using a disorientation angle of 2.9°, a 
repeat distance of 9,135 A and the selection rule ] = —153n + 331m. The 
diffraction pattern is divided into three parts (1/60 A~! and 1/6.5A~}, 
1/6.5A_! and 1/5.5 A, and 1/5.5A~! and 1/3.3 A) by the boundaries 
indicated in black circles, and the scaling was independently performed for 
each region. The overall similarities between the observed and calculated 
intensities are satisfactory: R-factor-fit = 0.143, R-factor-non-fit = 0.207 
and R-factor-non-fit (overall) = 0.184. The R-factor-fit and the R-factor- 
non-fit are defined in Supplementary Fig. 1. 


and the two domains are untwisted (cyan in Fig. 2a, b). The two 
conformations are related by a 20° rotation of the two major domains 
around the axis passing along the front of subdomain | and the side 
of subdomain 3, which was identified using the program 
DYNDOM" (red line in Fig. 2a, b). The relative rotation of the 
two major domains is achieved by rotations of the dihedral angles 
of residues 141-142 and 336-337 (blue in Fig. 2a). Another charac- 
teristic of our F-actin subunit is an extended D-loop, to fit the surface 
of the upper subunit along the strand (Fig. 3b). A 20° relative rotation 
of the two major domains, together with an extension of the helical 
D-loop of the tetramethylrhodamine-conjugated actin (TMR-—actin) 
crystal structure”, would basically account for the peak positions in 
the low-resolution part of the diffraction pattern, which determine 
the overall shape of the subunit (R-factor fit of ~0.25). 


Contacts between subunits 


Figure 3 displays the contacts between the subunits within our 
F-actin model. Each residue with a Co atom located within 10 A from 
any other Cx of a contacting subunit is highlighted in red, blue or 
yellow. Each subunit is labelled with a number, n, 1+ 1 or n+ 2, 
from the barbed-end side of the filament. The intra-strand interface 
appears to be extensive (Fig. 3b). The projection 283-294 of subunit 
n+ 2 (red) is enclosed by residues 61-65, 200-208 and 241-247 of 
subunit n (blue), resembling a ball (red) in a socket (blue). 
Additionally, the D-loop 38-49 of subunit n (blue) is extended 
towards the hydrophobic groove of subunit n+ 2 (red) between 
subdomains 1 and 3: Val43 and Met44 of subunit n contact 
Leu 346 and Phe 375 of subunit n + 2. 

The inter-strand contacts are formed by two projections from 
subdomain 4 (Fig. 3c). One is the carboxy terminus of the o-helix 
191-199 of subunit 1 (blue), which contacts the amino terminus of 
the o-helix 110-115 of subunit n + 1 (yellow) in the opposite strand. 
The other is the hydrophobic plug 265-271 in subunit n + 1 (yellow), 
which contacts four regions in the opposite strand, including 
residues 201-203 and 39-42 in subunit n (blue) and residues 170— 
174 and 285-286 of subunit m + 2 (red). The contacts mediated by 
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Figure 2 | Transition from the G-actin conformation to the flat 
conformation in F-actin. a, Front view. The subunit in our F-actin model 
(cyan) and the TMR-actin crystal structure (yellow; PDB code 1J6Z)” are 
superimposed on subdomains 1 and 2. Subdomains 3 and 4 are rotated with 
respect to subdomains 1 and 2 about the rotational axis (red line) in the 
direction indicated by the red arrow. The rotation is associated with bends of 
the polypeptide chain at residues 141-142 and 336-337, as indicated in blue. 
b, Side view, viewed from the left-hand side of subdomains 3 and 4 in 

a. Subdomains 1 and 2 are shown as light cyan and light yellow Cx traces. 
Stereo views of these panels are shown in Supplementary Fig. 2. c, A subunit 
in our F-actin model, with ADP depicted as red CPK balls and Ca** asa 
yellow CPK ball. In a-c, the subdomains are labelled with red numbers. 

d, Details of the hinge region indicated in green in c. The subunit in our 
F-actin conformation (cyan) and the TMR-actin crystal structure (yellow) 
are superimposed on the four-f-strand core of subdomain 3 including 

Val 163 and Ile 175 (red). The Cx positions are indicated by balls. 


the hydrophobic plug are three-bodied. Residues 39—42, 201-203 and 
286 contribute to both the inter- and intra-strand contacts. 


Bacterial MreB 


Bacterial MreB isa structural and functional homologue of actin 
MreB usually forms thin, non-helical filaments consisting of two 
protofilaments or bundles of them, and it rarely forms the two- 
stranded helical polymers seen in F-actin. The MreB crystal contained 
a straight, single-stranded protofilament, in which the contacts 
between the subunits were deduced to be biologically relevant’’. 
MreB has a closed cleft and the two major domains are untwisted, 
and thus it resembles the flat conformation of our F-actin subunit 
(Supplementary Fig. 3a, b). The intra-protofilament contacts are also 
similar to those of our F-actin model: residues 266-275 are enclosed 
by three segments, 227-229, 235-238 and 51-59 (Supplementary Fig. 
3c). Because the polypeptide chain of actin is longer than that of MreB, 
there are several insertions within the actin sequence. The actin- 
specific insertions 197—204 and 264-272 contribute to the inter-strand 
contacts, which are required to generate the two-stranded helical 


polymers. 


13,21,22 


Comparison of our F-actin model with previous models 

One important difference between our model and the original 
Holmes model’ (and refinements based on this model, the Lorenz 
model’) is the filament diameter. The Holmes model was built to fit 
the published radius of gyration of 25 A, which is actually too big. 
This caused the two strands in the Holmes model to be too far apart, 
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Figure 3 | Intra- and inter-strand contacts within our F-actin model. a, Our 
model of the F-actin structure including 13 subunits. The two subunits 
marked by the oval are magnified in b, and the three subunits marked by the 
triangle are magnified in c. b, c, Residues contributing to the intra-strand 
contacts between subunits (b) and those facilitating the inter-strand contacts 
(c) are highlighted. These residues have a Co within 10 A from any other Co 


which necessitated rebuilding of the hydrophobic plug to achieve 
contact. Our model is substantially smaller, and has a radius of gyra- 
tion of 23.7 A. The smaller filament diameter is clearly substantiated 
by the three-dimensional map independently reconstructed from 
cryoelectron micrographs at the resolution of 13.8 A, which is con- 
sistent with our model (Fig. 3d). Holmes et al. arrived at a similar 
conclusion for their actin model, which was built to fit electron 
micrographs of decorated actin (the Holmes 2003 model)’. There 
seems to be little doubt that the F-actin structure has a smaller dia- 
meter than in the original Holmes model. Therefore, it is unlikely that 
the hydrophobic plug largely alters its conformation upon the G- to 
F-actin transition because of the narrow inter-strand gap. 
Nevertheless, in our model the position and the shape of the hydro- 
phobic plug are adjusted within the narrow inter-strand gap, where 
the plug has major roles as the inter-strand connector. This role is the 
same as that in the original Holmes model, which is consistent with 
the results of the actin mutagenesis experiments”’. 

Our new model differs from the original Holmes model, and is 
much closer to the Holmes 2003 model'®. However, the Holmes 
2003 model does not reach the consistency of our model with our 
independently reconstructed electron micrograph map (Supple- 
mentary Figs 4 and 5 and Supplementary Videos | and 2). The subunit 
in the Holmes 2003 model is also flat, but it has a slightly different 
orientation in the filament and defects in the nucleotide-binding site, 
probably due to the complex shifts and rotations of each of the four 
subdomains. This is in contrast to our simple rotation of each of the 
two major domains. An extra shift of the C terminus applied to the 
Holmes 2003 model worsens the fitting to our electron micrograph 
map™. 


Discussion 


The previous crystal structures of actin and actin-related proteins are 
classified into the closed and open conformations, in which the two 
major domains are twisted relative to each other, regardless of bound 
nucleotides*’*>*’. The closed and twisted conformation is the struc- 
ture of actin that cannot polymerize, and the open and twisted con- 
formation is found with bovine actin-related protein 3 (Arp3) in the 
inactive Arp2/3 complex*®**. The opening of the cleft affects the 
binding modes of nucleotides”, and thus the closed-to-open con- 
formational transition is possibly related to nucleotide release. In 
contrast, the flat conformation of our F-actin subunit constitutes a 
new class, together with MreB. The flattening seems to be associated 
with the polymerization and ATPase activity. 

The close association between the flattening and the polymeriza- 
tion is supported by the following experimental results. Latrunculin, 
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of a contacting subunit. Black and red numbers represent the residue and 
subdomain numbers, respectively, whereas n, n + 1 and n + 2 are subunit 
numbers. d, The three-dimensional map independently reconstructed from 
cryoelectron micrographs is superposed with our F-actin model (resolution 
13.8 A). The volume of F-actin is 100%. 


a toxin isolated from sea sponge, binds to the cleft between the two 
major domains and inhibits polymerization’. The inhibition of 
polymerization is accounted for by the blocking of the conforma- 
tional transition due to the toxin binding. In contrast, the nucleotide- 
free actin polymerizes more readily than the control’’. The removal 
of the nucleotide, which connects the two major domains, must 
facilitate the flattening, thereby promoting the polymerization. 

When an actin molecule is incorporated into a filament, the actin 
ATPase is activated*®. Gln 137 has a crucial role in the ATPase: the 
replacement of Gln 137 by alanine markedly reduced the ATPase 
activity, whereas it promoted the polymerization*’. The G-actin crys- 
tal structures indicated that Gln 137 anchors a water molecule that 
attacks the y-phosphate (y-P) of the bound ATP, and the extremely 
slow ATPase of G-actin seems to be a consequence of the geometry 
(the long distance and the shifted orientation) between the water 
molecule and the y-P**. Gln 137 is located in subdomain 1, whereas 
ATP tightly binds to subdomains 3 and 4 (Fig. 2c, d). Thus, the 
domain rotation that is associated with the G- to F-actin transition 
moves Gln 137 and the y-P closer to each other, thereby probably 
allowing the bound ATP to be hydrolysed (Supplementary Fig. 6). In 
MreB with bound adenylyl imidodiphosphate (AMPPNP), in fact, 
the y-P is close (about 4 A) to the side-chain of the glutamate residue 
(Glu 131), at the position equivalent to Gln 137 of actin. 

However, it should be noted that our F-actin model does not allow 
us to discuss a more detailed mechanism of the ATPase. This is 
because the positions of the side chains are not sufficiently accurate 
(see also Methods Summary) and our model does not include water 
molecules, which are essential for the ATPase. Therefore, we cannot 
discuss additional mechanisms that may contribute to the ATPase, 
such as structural changes in the loop 108-111, as observed in E. coli 
HSP70 homologue Dnak (ref. 33), and a shift of the y-P through the 
rearrangement of the two phosphate-binding loops. Moreover, 
although it seems probable that the flattening of the actin subunit 
occurs before the ATP hydrolysis, we cannot exclude the possibility 
that the flattening occurs subsequent to the phosphate release. 

The flattening of the actin molecule generates changes in the intra- 
molecular interactions between the two major domains, especially 
around the two loops that connect them: the ‘sensor loop’ 71-73 
(ref. 25) and residues 108-111 (Supplementary Fig. 7). First, 
Arg 206, Glu 72, Arg 183, Asp 187, methylated (Me)His 73, Asp 179 
and Arg 177 form a concatenation of salt bridges, which probably 
contribute to stabilizing the F-actin conformation. The highly charged 
MeHis 73 residue enters the concatenation as a result of the domain 
rotation, which is probably the cause for MeHis 73 (refs 34 and 35). 
Second, Pro 109 and Leu 110, in the loop 108-111 of subdomain 1, 
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detach from the spot around Val 163 and Ile 175 in the hydrophobic 
core of subdomain 3 (Fig. 2d). This allows Leu 110 to interact with the 
subunit in the opposite strand at Thr 194, and this subunit—subunit 
contact also probably stabilizes the flat conformation. 

How does the flattening of the actin molecule confer the F-actin 
formation? A comparison of our F-actin structure with the two- 
stranded, non-helical straight polymer observed in the crystal of 
the formin—actin complex’ provides the answer. In Fig. 4, the intra- 
and inter-strand contacts of the straight polymer are compared with 
those of our F-actin model by placing the light-cyan-coloured sub- 
units n at the same rotational position about the helix axes. The 
flattening moves subdomain 4 of the subunit towards the helix axis 
by about 4 A, whereas subdomains 1, 2 and 3 are located at about the 
same radii from the helix axis. As a result, the C terminus of 
the o-helix (182-196, green) in subdomain 4 of subunit n moves to 
the vicinity of the N terminus of the o-helix (113-126, yellow) in 
subdomain 1 of subunit n+ 1 (Figs 4a, b), facilitating the inter- 
strand connections. Furthermore, the flattening generates extensive 
intra-strand connections. In the straight polymer, the loop 286-289 
(red and yellow balls) in the subunit n + 2 is on residues 200-202 of 
subunit 1, and does not contact the loop 242-246 (Fig. 4a). In con- 
trast, in F-actin, the loop 286-289 (red and yellow balls) in subunit 
n+ 2 is tightly surrounded by three segments, the residues 200-208, 
the loop 242-246 and the residues 61-65 of subunit n (Figs 4b, c). 
Moreover, the shift of the loop 286-289 enables the residues 202-203 
to contact the hydrophobic plug on the opposite strand at Glu 270 of 
subunit n+ 1. This region is the node between the intra-strand and 
inter-strand contacts. Thus, the flat conformation allows more 
extensive contacts between the subunits than in the straight polymer. 
As expected, filament formation is impaired by substitutions or 
modifications of the residues involved in these contacts, including 
the double mutation P243K/A204E (ref. 14), the phosphorylation of 
Thr 201 to Thr 203 (ref. 37), the mutations 1287S and F200S (T.A. 
and T.O., unpublished data) and the mutations D286R and D288R 
(Supplementary Fig. 8). The double and triple mutations K61A/ 
R62A, E241A/R244A, D286A/D288A and R290A/K291A/E292A are 
also lethal in yeast’. 

The previous comparison was made between the non-helical poly- 
mer in the formin—actin complex and our F-actin model. Even if we 
use computer modelling to twist the non-helical polymer into a 
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Figure 4 | Comparison between the two-stranded straight polymer in the 
actin-formin crystal and our helical F-actin polymer. a, The two-stranded, 
non-helical straight polymer observed in the crystal of the actin—formin 
complex (PDB code 1Y64)**. The polymer consists of two protofilaments 
(the strand including subunit n + 1 and the strand including subunits n and 
n+ 2) that are straight and parallel to each other. Similar intra-strand 
contacts are also observed in other crystals**~°. b, Side view of our F-actin 
structure. In a and b, the thick rods are «-helices 113-126 (yellow) of subunit 
n+ 1 and 182-196 (green) of subunit 7. c, Details of the contacts around 
Asp 286-Ile 287-Asp 288-Ile 289 of subunit m + 2 (red and yellow balls), 
surrounded by three segments of subunit n (light cyan). Note that Asp 286 
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helical polymer, by azimuthally rotating the G-actin structure, the 
inter-strand contacts are not strengthened and would remain 
unstable. Actually, the intra-strand contacts in the computer model 
are closer to those of the single-stranded, twisted polymer observed in 
the crystal of actin split by protease from E. coli A2 strain (ECP- 
actin)** (Supplementary Fig. 9). The helical disposition alone 
generates one-half of the intra-strand contacts around residues 
286-289 of our F-actin structure, whereas the other half is completed 
by the additional flattening of the subunits. Thus, stable F-actin is 
formed by a combination of the flattening of the individual subunits 
and the helical disposition of the subunits. Nevertheless, the two 
factors may not necessarily be tightly coupled. According to molecu- 
lar dynamics simulations, actin favours the closed and twisted con- 
formation regardless of the bound nucleotide*’, and thus the flat 
conformation in isolation must be less stable. Therefore, the 
F-actin structure may be influenced by the destabilizing force 
imposed by the strain of the subunit conformation and the stabilizing 
effect of the contacts between subunits. We speculate that the balance 
between the two might induce the multiple local conformations of 
F-actin, which could account for the dynamics of the F-actin struc- 
ture, as observed for tubulin and FtsZ?!*0*!, 


METHODS SUMMARY 


Well-oriented sols of F-actin, which were formed by Ca-actin and length-con- 
trolled by gelsolin, were prepared according to the procedures described prev- 
iously'*. From the sols thus obtained, we recorded X-ray fibre diffraction 
patterns at BL41XU, BL40B2 and BL45XU-SAX at SPring-8 (Fig. 1 in 
Supplementary Methods). From the patterns, the layer-line intensities were 
extracted and used for modelling of the F-actin structure as ‘grouping intensities’ 
re-indexed by the selection rule = —6n + 13m. The modelling was started from 
a rigid body refinement of three crystal structures that were aligned in a straight 
polymer, as observed in the formin—actin complexes”. The resulting models 
were further refined using 12 elastic normal-mode motions of the actin mole- 
cule, the molecular dynamics refinement and the final minimization of the 
effective energy term with FX-plor*’. To estimate the restriction of the reflection 
term imposed on the structure thus obtained, we examined the effects of struc- 
tural modifications on the R-factor. The R-factor increased by 5% when a shift of 
two successive residues by 2 A was made along the helix axis or a shift of six 
successive residues by 3 A was made in the plane perpendicular to the helix axis 
from the plausible positions. Thus, the reflection term restricts the conformation 
of the peptide chain at the level of several residues, and the detailed structure is 
affected by the conformational energy. The structural analysis of F-actin by 


behind Ile 287-Asp 288-Ile 289 joins the network including Arg 39 of 
subdomain 2, Glu 205 and Thr 203 of subdomain 4, and Glu 270 of the 
opposite strand. Ile 287 contacts a hydrophobic patch composed of Phe 200, 
Ala 204, Ie 208 and Pro 243 of subdomain 4. Asp 288 joins the salt bridges 
including Asp 244, Lys 291 of subdomain 4 and Arg 62 of subdomain 2. 

Ile 289 contacts Ile 64 of subdomain 2 and Tyr 166 of subunit n + 2. The 
residue that interacts with the N terminus of «-helix 202-215 shifts from 
Asp 288 in the parallel polymer to Asp 286 in our F-actin. In a—c, the 
subdomain numbers are indicated in red and the residue numbers are black, 
and the subunit numbers are shown by n, n + 1 and n+ 2. 
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cryo-electron microscopy was performed under the same conditions and by the 
same procedures as those described previously**“*. The actin mutagenesis 
experiment was performed using a baculovirus-based expression system*'. The 
protein structure was drawn using Molscript and Raster3D**~*’. 
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Intersubunit coordination in a homomeric 


ring ATPase 


Jeffrey R. Moffitt!*, Yann R. Chemla'*+, K. Aathavan’, Shelley Grimes, Paul J. Jardine’, Dwight L. Anderson** 


& Carlos Bustamante)?” 


Homomeric ring ATPases perform many vital and varied tasks in the cell, ranging from chromosome segregation to protein 
degradation. Here we report the direct observation of the intersubunit coordination and step size of such a ring ATPase, the 
double-stranded-DNA packaging motor in the bacteriophage 29. Using high-resolution optical tweezers, we find that 

packaging occurs in increments of 10 base pairs (bp). Statistical analysis of the preceding dwell times reveals that multiple 
ATPs bind during each dwell, and application of high force reveals that these 10-bp increments are composed of four 2.5-bp 
steps. These results indicate that the hydrolysis cycles of the individual subunits are highly coordinated by means of a 

mechanism novel for ring ATPases. Furthermore, a step size that is a non-integer number of base pairs demands new models 


for motor-DNA interactions. 


Multimeric ring ATPases of the ASCE (additional strand, conserved 
E) superfamily represent a structurally homologous yet functionally 
diverse group of proteins involved in such varied tasks as ATP syn- 
thesis, protein unfolding and degradation, and DNA translocation'>. 
Despite their importance, the coordination mechanism between the 
hydrolysis cycles of the individual and often identical subunits that 
compose these ringed proteins is poorly understood. Recent crystal- 
lographic and bulk biochemical studies** suggest various models of 
coordination in which subunits act sequentially and in order®", 
simultaneously and in concert!’, or independently and at random"’. 
Unfortunately, direct observation of subunit dynamics has only been 
reported for a heteromeric system, the F1 ring of ATP synthase’, the 
heterodimers of which function in a sequential manner. 

The DNA packaging motor in the Bacillus subtilis bacteriophage 
(29 provides a model system to investigate the intersubunit coordi- 
nation in homomeric ring ATPases because it can be fully reconsti- 
tuted in vitro’, it has a relatively slow translocation rate'*’”, and it has 
been extensively characterized by bulk” and single-molecule'*'”*"~* 
methods. Packaging of the double-stranded (ds) DNA genome of (29 
into its proteinaceous precursor capsid (prohead) is driven by a 
powerful molecular machine'* which consists of three multimeric 
rings organized coaxially around the point of DNA entry”: a dode- 
cameric™ ring of gene product 10 (gp10) known as the head-tail 
connector; a pentameric**”® ring of RNA molecules known as the 
prohead-RNA (pRNA); and a pentameric™”® ring of the ATPase 
gpl6 (see Fig. 1a). Sequence homology” places gp16 in the FtsK/ 
HerA family of dsDNA translocases”*. This family is itself a member 
of the large ASCE superfamily, thus relating the packaging motor to 
the ubiquitous AAA+ and RecA-like proteins*”. 

Recent studies of the packaging motor have suggested a mech- 
anism in which the subunits operate sequentially’’, each binding 
ATP, hydrolysing it and translocating the DNA by 2 bp'*”, before 
the next subunit repeats this cycle. Although this scheme is consistent 
with the observed data'”***° and with sequential models proposed for 
other ring ATPases***"*, direct observation of the coordination of 


the mechanochemical cycles of the individual subunits in the pack- 
aging motor is still lacking. Here we report the first measurements of 
the individual packaging steps of the @29 motor, which reveal both its 
step size and the novel coordination between its subunits. Because of 
its relation to the ASCE superfamily’’, the mechanism for the pack- 
aging motor we propose here may have implications for the function 
of a diverse set of ring ATPases. 


DNA is packaged in 10-bp increments 


To probe the dynamics of the packaging motor of (29, single pro- 
head—motor—DNA complexes are tethered between two 860-nm-dia- 
meter polystyrene beads held in two optical traps as in Fig. la. 
Packaging is initiated in situ’*”’ or by restarting stalled complexes'*"” 
in an ATP packaging buffer and monitored in a semi-passive mode in 
which the tension applied to the motor is kept within a narrow range 
by periodically changing the distance between the two traps 
(Supplementary Fig. 1). Motor translocation is determined from 
the decrease in the contour length of the DNA tether and is followed 
with base-pair-scale resolution®'~?. 

In our first experiments we probe packaging at an average low 
external tension of ~8 pN, and at ATP concentrations ([ATP]) above 
and below the Michaelis constant (K,) of the motor, ~30 uM”. 
Figure 1b shows representative packaging traces collected under these 
conditions. Across the full range of [ATP], packaging of DNA occurs 
in a stepwise manner consisting of ‘dwells’, in which the DNA length 
remains constant, followed by ‘bursts’, in which DNA is translocated 
in ~10-bp increments. We determine the average length of DNA 
encapsidated in these packaging bursts for each [ATP] from the 
periodicity in the average pairwise distance distribution (PWD) as 
seen in Fig. lc. No statistically significant trend is observed in the size 
of these bursts as a function of [ATP] (see Fig. 1d); thus, the average 
of these values, 10.0 + 0.2 bp (s.e.m.), is the best estimate for the burst 
size. 

To elucidate the mechanism by which the motor translocates in 
10-bp increments, we analysed the time the motor spends in the dwell 
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Figure 1| Bacteriophage 29 packages DNA in bursts of 10 bp. a, A single 
packaging bacteriophage prohead—motor complex and its dsDNA substrate 
are tethered between two beads each held in an optical trap and held under 
tension, F. Motor dynamics are inferred from changes in the contour length 
of the unpackaged DNA, L, as a function of time. Inset: cryo-electron 
microscopy reconstruction of the full motor complex”® (courtesy of M. 
Morais), ATPase in red, pRNA in yellow, connector in cyan and capsid in 
grey with a top view cartoon of the ATPase ring alone (below, grey). 

b, Representative packaging traces collected under low external load, ~8 pN, 
and different [ATP]: 250 uM, 100 uM, 50 uM, 25 uM, 10 uM and 5 uM in 
purple, brown, green, blue, red and black, respectively, all boxcar-filtered 
and decimated to 50 Hz. Data at 1.25 kHz are plotted in light grey. Contour 
length is plotted in bp of dsDNA. c, Average pairwise distributions of 
packaging traces selected for low noise levels (50% of all packaging data; see 
Supplementary Figs 2 and 3). Colour scheme as in b. d, The average size of 
the packaging burst versus [ATP] determined from the periodicity in c. Error 
bars are the standard deviation in the slope of a linear fit to the peak 
positions. Data collected at 500 .M and 1 mM [ATP] are not shown in b and 
c for clarity. 


before each burst and the time it takes to complete each burst as a 
function of [ATP]. Figure 2a shows the distribution of dwell times 
before the packaging bursts. The mean dwell time, seen in Fig. 2b, 
shows a strong dependence on [ATP] that follows an inverse hyper- 
bolic expression, (t) =(Kj/2+[ATP]) /(kmax[ATP]), with a Ky. of 
23+7uM (s.d.) and a knax of 8.7+0.7s ' (s.d.). In contrast, 
Fig. 2b shows that the average duration of the packaging burst has 
little or no dependence on [ATP], suggesting that ATP binding 
occurs only in the dwells and not in the bursts. Taken together these 
observations produce a packaging velocity with a Michaelis-Menten 
[ATP] dependence consistent with previous measurements”. 

The specific shape of the dwell time distributions seen in Fig. 2a 
provides further information on the kinetic transitions within a sin- 
gle dwell. In particular, the more sharply peaked the distribution, the 
larger the number of rate-limiting kinetic transitions that compose 
the dwell**. We quantify the degree to which these distributions are 
peaked with the ratio of the squared mean of the dwell times to their 
variance, the inverse of the randomness parameter™ (Fig. 2c). It can 
be shown that this parameter, m,,;,, provides a strict lower bound*® on 
the number of rate-limiting transitions under each [ATP] that occurs 
during the dwell. 

At limiting [ATP], 5uM <K,,, we measure an mj, of 2.0 + 0.4 
(s.e.m), indicating that there are at least two rate-limiting transitions 
in each dwell. Because ATP binding must be rate limiting under these 
conditions, we conclude that no less than two ATP molecules bind to 
the motor before each 10-bp burst. (In contrast, if a single ATP were 
to bind during each dwell, one would expect the dwell time distri- 
bution to be a single exponential and 1,,;,, to be 1 (refs 34, 36).) At 
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Figure 2 | Dwells before 10-bp bursts contain multiple kinetic events. 

a, Probability distributions for the dwell times preceding a 10-bp burst under 
low external load, ~8 pN, and different [ATP]: colour scheme as in Fig. 1. 
Distributions were estimated using kernel density estimation with a 
Gaussian kernel and the optimum bandwidth” and are truncated at the 
lowest detectable dwell time. Supplementary Fig. 2 contains the number of 
observed bursts for each [ATP]. Distributions for 500 uM and 1 mM [ATP] 
are not shown for clarity. b, The mean dwell time before the 10-bp bursts 
(red circles) for all [ATP] with an inverse hyperbolic fit (black line) and the 
mean duration of all bursts (blue squares, average denoted by black line). 
c, The minimum number of rate-limiting kinetic events during the dwell 
before the 10-bp bursts, min, for all [ATP]. Error bars are the standard error. 


saturating [ATP], 1mM > K,,, we measured an myjn of 3.8 + 0.5 
(s.e.m.). Because binding is no longer rate limiting, this indicates 
that at least four non-binding transitions must also occur in each 
dwell. For intermediate [ATP], both binding and non-binding tran- 
sitions can be rate limiting; thus, we expect Min to peak to a value 
greater than either of the extreme values, exactly as is observed. Thus, 
Fig. 2c indicates that in total no less than six kinetic transitions must 
occur in the dwell before each 10-bp burst—at least two ATP binding 
events and at least four non-binding transitions. 


Packaging occurs in four 2.5-bp steps 


The findings that packaging occurs in 10-bp increments—five times 
larger than the 2-bp value proposed from bulk measurements”? — 
and that the preceding dwells contain multiple ATP binding transi- 
tions suggest that the 10-bp bursts may be composed of multiple 
smaller steps that in general may be too fast to resolve under the 
above conditions. This inference is supported by the observation that 
many bursts have durations larger than the measurement bandwidth 
(Figs 1b and 2b), indicative of intermediate kinetic transitions. 
Supplementary Fig. 4 shows that occasionally these intermediate 
transitions can be resolved, appearing as short micro-dwells that split 
the 10-bp burst into smaller steps. A correlation analysis** confirms 
that these smaller steps occur in groups that sum to 10 bp, ruling out 
the possibility that these events represent a variable burst size 
(Supplementary Discussion). 

Asa direct demonstration of the composition of the 10-bp bursts, we 
follow packaging against high external loads at near-saturating [ATP] 
(250 uM). Because translocation steps correspond to force-generating 
kinetic transitions, we expect that the duration of the micro-dwells 
preceding these steps will increase with increasing external force’’. 
Figure 3a shows that, under 40 pN of average load, smaller steps of 
~2.5bp and integer multiples thereof can be clearly and frequently 
observed. The PWD for this data, shown in Fig. 3b, reveals a periodicity 
of 2.4+0.1bp (s.d.) and the step size distribution, shown in 
Supplementary Fig. 6, has a peak at 2.48 + 0.03 bp (s.e.m.). The inset 
to Fig. 3b shows that the periodicity in the PWD is independent of 
force, indicating that the 2.5-bp step size is a constant feature of the 
motor and that the 10-bp bursts observed at low force are composed of 
four 2.5-bp steps. This conclusion is further supported by the promi- 
nent fourth peak observed in the PWD which is consistent with the 
corresponding 10-bp periodicity observed at low force. 
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Figure 3 | The 10-bp bursts are composed of four 2.5-bp steps. 
a, Representative packaging traces collected with external loads of ~40 pN 
and 250 uM [ATP]. Data in light grey are plotted at 1.25 kHz whereas data in 
colour are boxcar-filtered and decimated to 100 Hz. b, Average pairwise 
distribution of packaging traces selected for low noise levels (50% of all 
packaging data; see Supplementary Figs 2 and 3). Inset: force dependence of 
the observed spatial periodicity. The solid line is the mean for all forces, 
2.4 + 0.1 bp (s.e.m.). ¢, Dwell time histogram for the 2.5-bp steps observed 
under the packaging conditions seen in a plotted in blue circles with a bi- 
exponential fit in black (n = 2,662). 


The dwell time distribution associated with the 2.5-bp steps 
(Fig. 3c) is well described by a weighted sum of two exponential 
decays, with a fast rate of 22 + 2 s | (s.d.) and a slow rate of 841 

' (s.d.). The fast rate rationalizes the fraction of 2.5-bp steps that 
are missed in our analysis and the slow rate is consistent with one out 
of every four dwells coming from the same peaked dwell time distri- 
bution observed at low force (Supplementary Figs 5 and 6). Finally, 
our data do not support alternative interpretations of the 2.5-bp 
periodicity, such as distortions from B-form or alternating integer 
steps, as discussed in Supplementary Figs 7 and 8 and in the 
Supplementary Information, although some variability in the step 
size on the ~0.1-bp scale cannot be ruled out. 


Intersubunit coordination 

Taken together these results indicate that the mechanochemical 
cycles of the identical subunits of the packaging motor of 29 are 
highly coordinated, with the loading of ATP and the translocation of 
DNA segregated into two distinct phases that comprise the mechano- 
chemical cycle of the entire ring (Fig. 4a). During the dwell phase the 
DNA is held at constant length while multiple ATPs are loaded, 
giving this dwell its observed [ATP] dependence (Fig. 2). This process 
is followed by the burst phase in which DNA is packaged in four 
increments of 2.5 bp, totalling 10 bp of DNA translocated per cycle 
(Figs 1d and 3b). Thus, this phase has an average duration that is 
independent of [ATP] but dependent on force (Fig. 2b). 

The observation of four translocation steps per burst strongly 
suggests that four ATPs bind to the ring during each dwell, one for 
each of the subsequent steps in the burst phase. This inference is 
consistent with the measured value of mpin at limiting [ATP] as 
reversibility in binding or differences in binding rates will decrease 
the observed value of min from the actual number of binding 
events™*. It is also consistent with the 10-bp burst size, as a single 
ATP provides insufficient free energy to package 10 bp against the 
high forces tested previously'*'”*’. Moreover, the binding of four 
ATPs predicts a coupling constant between ATP consumption and 
packaging of 2.5 bp per ATP, in reasonable agreement with the ~2 bp 
per ATP value estimated from bulk measurements’*”’. The ~25% 
discrepancy may be explained by additional processes that consume 
ATP in bulk measurements, such as the repackaging of DNA that 
slips from the capsid'*'°. However, it is also possible that a regulatory 
fifth ATP is bound each cycle and hydrolysed futilely. 

Our data also restrict the possible mechanisms by which these ATPs 
bind to the ring. The requirement that multiple substrate molecules 
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Figure 4 | Intersubunit coordination in the ring ATPase of 929. 

a, Schematic diagram of the two-phase mechanochemical cycle of (929 
overlaid on a sample packaging trace. b, Detailed kinetics of ATP binding. 
Binding occurs in two steps: ATP docking (green, T) followed by tight 
binding (red, T*). ¢, Schematic diagram of the communication between 
subunits during ATP binding. Upon tight binding of an ATP, the binding 
pocket of the next subunit, formerly inactive (grey), is activated for docking 
(green). d, Schematic depiction of the full mechanochemical cycle of 29. 
During the burst phase, ADP may remain on the ring (blue) to be released in 
the dwell phase. One subunit must be distinct from the others (purple) to 
break the symmetry of the motor and generate only four steps per cycle. The 
identity of this subunit may change each cycle. P;, inorganic phosphate. 


bind per cycle typically results in a sigmoidal dependence on [ATP]**— 
the hallmark of binding cooperativity. Yet previous velocity measure- 
ments’? and the mean dwell times measured here (Fig. 2b) are well 
described by a simple, non-sigmoidal ATP dependence. Because a sig- 
moidal [ATP] dependence arises whenever two or more binding events 
are connected reversibly’*, these two observations can be reconciled if 
and only if the binding of each ATP is separated from the other binding 
events by a largely irreversible transition. In this case the mean dwell 
time will display a non-sigmoidal [ATP] dependence despite the coop- 
erative binding of ATP (Supplementary Discussion). This requirement 
is consistent with previous observations for @29 (ref. 19) and related 
ring ATPases” which indicate that binding occurs in at least two kinetic 
steps: (1) a reversible ‘docking’ transition in which the molecule comes 
in weak contact with the catalytic pocket; followed by (2) a largely 
irreversible ‘tight-binding transition” in which ATP makes a stronger 
contact to the binding site and is committed to the hydrolysis cycle 
(Fig. 4b). More intermediate kinetic states in ATP binding are also 
possible, but are not required to explain our observations. 

In addition, the non-sigmoidal [ATP] dependence of the mean 
dwell times also restricts the temporal order in which the subunits 
can dock ATP. Kinetic schemes in which multiple subunits are capable 
of reversibly docking ATP simultaneously will necessarily have a sig- 
moidal [ATP] dependence because such schemes have binding events 
that are reversibly connected** (Supplementary Discussion). Thus, it 
is not sufficient to require that each loose docking of ATP be followed 
by a tight-binding transition; it is also required that only one subunit 
at a time can be involved in ATP docking. The simplest model that 
produces this time-ordered docking is one in which the tight-binding 
transition of one subunit allosterically activates the binding pocket of 
another subunit, making it competent to dock ATP, a process 
depicted in Fig. 4c (Supplementary Discussion). Although our data 
cannot uniquely determine the actual sequence in which the subunits 
bind ATP, this required allosteric activation in combination with the 
known interfacial interactions of adjacent subunits in related ring 
ATPases” strongly favours a mechanism in which successive ATP 
binding occurs in a sequential and ordinal fashion around the ring 
as depicted in Fig. 4b—d. 
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Figure 4d summarizes the kinetic transitions that occur during a 
complete mechanochemical cycle of the packaging motor. During 
the binding phase, four ATPs bind to the ring in the two-step process 
depicted in Fig. 4b, c. Previous work has shown that the release of 
phosphate precedes or coincides with translocation’’. Thus, after the 
ring has bound four ATPs, the burst phase is triggered, the first 
phosphate is released, and the first 2.5-bp step is taken. The burst 
phase then proceeds with three additional 2.5-bp steps preceded by 
three force-dependent micro-dwells. The number of rate-limiting 
steps, Min at saturating ATP (Fig. 2c) indicates that multiple kinetic 
transitions in addition to ATP binding must occur during the dwell 
phase. These transitions may correspond to the hydrolysis of the 
bound ATPs or the release of multiple ADPs from the previous cycle 
or, perhaps, both. Moreover, these transitions may occur together 
either as trigger or reset processes (Fig. 4d) or interspersed between 
ATP binding events. It is also possible that these additional events 
correspond to the tight-binding transitions, although this is unlikely 
given that tight binding is believed to occur quite rapidly”. 

The two-phase model we propose here is also consistent with 
previous measurements of the packaging motor’*"’. For example, it 
has been shown that the binding of a single non-hydrolysable ATP 
analogue is sufficient to pause the entire motor’’—a result consistent 
with the high degree of intersubunit coordination observed here. 
Furthermore, a biphasic sedimentation profile observed in sucrose 
gradient experiments suggests the ability of the ring to load multiple 
nucleotides’’, consistent with our model. Finally, the two-phase 
model predicts the same dependence of the packaging velocity with 
force and [ATP] as observed previously'*””. 


Non-integer step-size models 


Our finding that packaging occurs in four 2.5-bp translocation steps 
raises two notable questions on the motor mechanism. First, how 
does a dsDNA translocase move in a non-integer number of base 
pairs? And, second, howis the pentameric symmetry~*~*° of the motor 
broken, generating only four steps per cycle? A step size that is a non- 
integer number of base pairs prohibits any mechanism in which every 
motor subunit within a closed ring makes specific and identical 
chemical contacts with one strand of the DNA. Under this constraint, 
we can speculate on several alternative mechanisms that would pro- 
duce a 2.5-bp step size and the implications these models have for a 
pentameric motor. 

A non-integer step size could be generated if each subunit is capable 
of binding two or more alternating chemical moieties, which may or 
may not be on the same strand. Alternatively, the motor may make no 
specific contacts, but rather drives translocation by means of steric 
interactions, in which case the step size would be set not by the chemi- 
cal periodicity of the DNA but by the size of the internal conforma- 
tional changes that generate the power stroke. One example of such a 
mechanism is depicted in Fig. 5a, b where each subunit makes non- 
specific contacts with the major groove of the DNA. In such a model, 
generation of four translocation steps requires that one of the five 
subunits is not equivalent to the other four, breaking the symmetry 
of the pentameric ring. Because the nucleotide-free state is disengaged 
from the DNA”, one subunit may be required to retain nucleotide at 
the end of the previous cycle, ensuring that a strong contact with the 
DNA is maintained while the remaining subunits load ATP during the 
subsequent dwell phase. 

Alternatively, a single specific chemical contact may be made with 
the DNA but not with every subunit. In this class of models, only a 
subset of the subunits interacts with the DNA and relative motion 
between these subunits is what drives translocation. Figure 5c, d 
depicts an example of such a mechanism in which only two subunits 
make specific contact with the DNA. Translocation is achieved via an 
‘inchworm-like’ movement of these two subunits driven by distor- 
tions in the ring. One appeal of this mechanism is that because a 
single specific contact is made with the DNA, it produces an integer 
burst size, yet because the DNA-binding subunits are retracted by 
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Figure 5 | Packaging models that produce a non-integer step size. 
a, Depiction of a translocation model in which all subunits eventually 
contact the DNA (cyan spheres). The contacting subunit is outlined in black 
(top view). b, In such a model the size of internal conformational changes set 
the step size (side view). c, Depiction of a translocation model in which only 
two subunits contact the DNA (black outline). d, In such a model, one 
subunit maintains contact with the DNA (the latch) while the loading of each 
ATP introduces relative subunit-subunit rotations which distort the ring. 
This distortion extends one subunit (the lever) along the DNA by ~10 bp. 
The DNA contact point is then transferred from the latch to the lever, and 
the release of hydrolysis products relaxes the ring, retracting the lever and 
the DNA. The DNA contact is then transferred back to the latch, the ring 
resets and the cycle begins again. Because there are four subunits, the ring is 


retracted in four steps, dividing a 10-bp step into four ~2.5-bp substeps. The 
subunit colour scheme is the same as in Fig. 4. 


conformational changes induced into the ring by the other subunits, 
this burst can be divided into non-integer steps. Moreover, this 
model also explains naturally the observation of four steps by a pen- 
tameric motor, as one subunit interface must bear the accumulated 
distortion of the other four subunits, perhaps inactivating one of the 
five binding pockets. The relative motion between adjacent subunits 
needed to accommodate such a mechanism has been observed in the 
crystal structures of other ring ATPases®**° but has not been impli- 
cated as part of the translocation mechanism*'. Future measurements 
will be aimed at testing the spectrum of models discussed here. 


Conclusions 


We have presented here the first high-resolution measurements of 
the stepping dynamics of the ring ATPase of the packaging motor of 
bacteriophage 29. Our results indicate a highly coordinated two- 
phase mechanism in which the binding of ATP and the translocation 
of DNA by multiple subunits are organized into two distinct and 
temporally segregated portions of the mechanochemical cycle of 
the ring. Our observation of a 2.5-bp step size challenges the long- 
held view that DNA translocation must occur in integer base-pair 
increments, making it necessary to devise new and more complex 
models for motor—DNA interactions. In addition, although the inter- 
subunit coordination we observe is reminiscent of aspects of both the 
concerted-action model of the large tumour antigen of SV40 (ref. 15) 
and the sequential models proposed for the translocases BPV El, 
T7 gp4, 12 P4, Escherichia coli Rho and FtsK®”**, our mechanism 
represents a novel type of coordination not previously proposed for 
ring ATPases. Provocatively, although a two-phase mechanism con- 
trasts with these other models, it seems to be consistent with many of 
the biochemical''* and structural®””'°” observations made on 
these related systems. One notable exception is the ClpX protease 
for which biochemical data clearly suggest a limited degree of subunit 
coordination’®. However, recent work on a related system, the 
archaeal MCM, suggests that coordinated systems can take alterna- 
tive pathways when overcoming functional barriers such as catalyti- 
cally inactive subunits*’. Ring ATPases of the ASCE superfamily 
support a large and remarkably diverse set of cellular functions by 
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drawing on a comparatively small set of common structural features. 
Direct measurements of the intersubunit dynamics in these systems, 
such as those presented here, promise to reveal if these diverse cellular 
functions arise from a similarly small set of common structural 
dynamics. 


METHODS SUMMARY 


Complexes of prohead, gp16 and biotinylated DNA were prepared and attached 
to 860-nm-diameter polystyrene beads (Spherotech) coated with antibodies to 
(29 or streptavidin using methods that have been described previously'*”’. 
Tethers were assembled and packaging was restarted in a packaging buffer 
(50mM Tris-HCl, 50mM NaCl, 5mM MgCl, 10 ug ml! BSA, 0.1% NaN3, 
pH7.8) supplemented with various amounts of ATP (Sigma-Aldrich)'*””. 
Experiments were conducted in two separate dual-trap instruments, built around 
two different trapping lasers**”’, and calibrated using standard techniques****. 
The contour length of the DNA tether was calculated from the measured force 
and extension using the extensible worm-like-chain model as described 
previously'*'°. Pairwise distributions were calculated from data filtered with a 
sliding 20-ms boxcar window as described previously“*. The location and dura- 
tion of stepping transitions were found with a t-test analysis similar to previous 
methods”. Dwell times were calculated directly from the time between transi- 
tions, and burst durations were calculated from the number of points within 
transitions. The mean and variance were calculated directly from these dwell 
times, and the errors were estimated using a bootstrap method. min was calcu- 
lated directly from these moments**”**. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Sample preparation. Proheads, gpl16 and genomic DNA were isolated as 
described previously*’. For a stalled-complex method of initiation’*”’, a Clal 
(New England Biolabs) digest of genomic DNA was biotinylated using Klenow 
exo (New England Biolabs) to fill in the overhang with biotinylated nucleotides 
(Sigma-Aldrich)'*!”. Preferential packaging of the left end of the genome” 
favours the formation of prohead—motor—-DNA complexes with the 6,149-bp 
fragment of the Clal digest. Stalled complexes were then bound to antibody 
beads, made as described previously'*, and introduced into the tweezers with 
streptavidin-coated beads. For the in situ method of initiation’, a 4,277-bp 
tether PCR amplified from lambda DNA with a biotinylated primer was bound 
to streptavidin-coated beads, and stalled prohead—motor complexes*** were 
bound to antibody beads. In both initiation methods, tethers were formed in 
the tweezers by physically bumping the two beads. The in situ initiation method 
was used for all [ATP] = 25 11M as data were in general less noisy and easier to 
collect; however, a severe drop in tether formation efficiency below 25 uM 
required the use of the stalled complex method for low [ATP]. All tether lengths 
were selected to reduce the effect of packaged DNA on motor dynamics'*”?. 
Optical trapping instruments. Two different optical trapping instruments were 
used in these studies****. All low-force data for 25 uM = [ATP] = 250 tM were 
collected using an instrument built around a 845-nm, 200-mW diode laser*. All 
other data were collected using an instrument built around a high-power, diode- 
pumped, solid-state 1,064-nm laser*’. Both instruments exploit the correlations 
in the motion of the two trapped beads with a differential detection technique” 
to achieve base-pair resolution on the second timescale*'**. Owing to increased 
laser absorption at 1,064nm‘*, the high-force data were collected in an 80% 
deuterium-oxide (DO) buffer to avoid heating effects caused by the high laser 
power needed to provide the large opposing forces. Supplementary Fig. 5 shows 
that although D,O alters the kinetics of packaging, it does not change the size of 
the packaging bursts. In addition, when working with the 1,064-nm trapping 
laser, an oxygen scavenging system was added (100g ml! glucose oxidase, 
20 pg ml’ catalase, 5mg ml‘ dextrose; Sigma-Aldrich) to prevent the forma- 
tion of the reactive species singlet oxygen. 

Calibration. Traps were calibrated using the thermal fluctuations of the trapped 
beads*’. The contour length was calculated from the measured extension and force 
with the extensible worm-like-chain model using a persistence length of 53 nm, a 
stretch modulus of 1,200 pN'“”, and an average B-form DNA rise of 3.4A bp! 
(ref. 50). Distance calibrations were corroborated with video microscopy’’, which 
was calibrated to 0.3% with two different distance standards (Nikon; Graticules) 
and confirmed to 1% by measuring the extension of DNA of different lengths, ~1, 
2, 3 and 5.6 kb. All packaging experiments were conducted in a semi-passive mode 


nature 


(Supplementary Fig. 1), in which the trap separation was kept constant as pack- 
aging proceeded and was changed discretely to keep the tension within a set range: 
~6-—10 pN for low-force experiments and ~33—46 pN for high-force experiments. 
All reported data have been corrected for small systematic errors, ~4% 
(Supplementary Figs | and 2), determined from the discrete changes in the trap 
separation as described in the Supplementary Discussion. 

Analysis. The one-sided autocorrelation of a positional histogram of each semi- 
passive mode segment was used to calculate the pairwise distributions’. 0.25-bp 
bins and 0.1-bp bins were used for the histograms for the low-force and high- 
force distributions, respectively. Data were selected for low noise and clarity of 
steps as described in the Supplementary Discussion. The pairwise distributions 
of the selected data were averaged together to produce the reported distributions. 
The average spatial periodicity was quantified from the position of the peaks. 
Subsets of the high force data were analysed to produce the step-size measure- 
ment as a function of force. 

Stepping transitions were identified using a t-test analysis and a probability 
threshold of observing a given t-value of 10° *. Dwell times were calculated from 
the time between transitions and step sizes were calculated from the difference in 
mean position between transitions. The reported dwell time distributions in 
Figs 2a and 3c were selected based on the size of the subsequent step: 8-12 bp 
and 1.5—4 bp, respectively. Burst durations were estimated from the exponential 
decay rate of the distribution of the number of contiguous points for which the 
t-value probability was below the 10“ threshold. Because of our limited time 
resolution, we could not observe the expected peak in this distribution; thus, we 
report the observed exponential decay, a value less biased by the time resolution 
than the mean. The effective bandwidth of the test algorithm was varied to 
maximize the number of observed steps while minimizing the systematic errors 
introduced into the moments of the distribution from a finite dead time. By 
assuming a Poisson distribution, it was determined that this dead time intro- 
duces negligible systematic errors in the calculated moments for all reported 
distributions. 
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Cold streams in early massive hot haloes as the main 


mode of galaxy formation 


A. Dekel’, Y. Birnboim’”, G. Engel’, J. Freundlich’, T. Goerdt', M. Mumcuoglu’, E. Neistein’*, C. Pichon”, 


R. Teyssier®’” & E. Zinger’ 


Massive galaxies in the young Universe, ten billion years ago, 
formed stars at surprising intensities'’”. Although this is commonly 
attributed to violent mergers, the properties of many of these 
galaxies are incompatible with such events, showing gas-rich, 
clumpy, extended rotating disks not dominated by spheroids'>. 
Cosmological simulations® and clustering theory®’ are used to 
explore how these galaxies acquired their gas. Here we report that 
they are ‘stream-fed galaxies’, formed from steady, narrow, cold gas 
streams that penetrate the shock-heated media of massive dark 
matter haloes*®. A comparison with the observed abundance of 
star-forming galaxies implies that most of the input gas must 
rapidly convert to stars. One-third of the stream mass is in gas 
clumps leading to mergers of mass ratio greater than 1:10, and 
the rest is in smoother flows. With a merger duty cycle of 0.1, 
three-quarters of the galaxies forming stars at a given rate are fed 
by smooth streams. The rarer, submillimetre galaxies that form 
stars even more intensely~’”’’ are largely merger-induced star- 
bursts. Unlike destructive mergers, the streams are likely to keep 
the rotating disk configuration intact, although turbulent and 
broken into giant star-forming clumps that merge into a central 
spheroid*"®". This stream-driven scenario for the formation of 
discs and spheroids is an alternative to the merger picture. 

It appears that the most effective star formers in the Universe were 
galaxies of stellar and gas masses of ~10''Mo at redshifts z= 2—3, 
when the Universe was ~3 Gyr old. (Mo, solar mass.) The common 
cases! show star-formation rates (SFRs) of 100M5—200Mo yr |. 
These include ultraviolet-selected galaxies termed BX/BM galaxies 
(ref. 14) and rest-frame optically selected galaxies termed sBzK gal- 
axies (ref. 15), to be referred to collectively as ‘star-forming galaxies’ 
(SFGs). Their SFRs are much higher than the 4Mo yr | in today’s 
Milky Way, although their masses and dynamical times are compar- 
able. The co-moving space density of SFGs is n~ 2 10°-*Mpc °, 
implying, within the standard cosmology (termed ACDM), that they 
reside in dark matter haloes of mass <3.5 X 10'’7Mo. The most 
extreme star formers are dusty submillimetre galaxies (SMG)'*”, 
with SFRs of up to ~1,000Moyr_' and n~2X10°Mpce ”°. 
Whereas most SMGs could be starbursts induced by major mergers, 
the kinematics of the SFGs indicate extended, clumpy, thick rotating 
disks that are incompatible with the expected compact or highly 
perturbed kinematics of ongoing mergers’**. The puzzle is how mas- 
sive galaxies form most of their stars so efficiently at early times 
through a process other than a major merger. A necessary condition 
is a steady, rapid gas supply into massive disks. 

It is first necessary to verify that the required rate of gas supply is 
compatible with the cosmological growth rate of dark matter haloes. 
The average growth rate of halo mass, M,, through mergers and smooth 


accretion, is derived’ on the basis of the extended Press—Schechter (EPS) 
theory of gravitational clustering (Supplementary Information, 
section 1) or from cosmological simulations'*'”. For ACDM, the cor- 
responding growth rate of the baryonic component is approximately 


M~6.6M15(1+z)? 0.165 Me yr! (1) 


where M,) = M,/10'*Mo and fos is the baryonic fraction in the 
haloes in units of the cosmological value, f, = 0.165. Thus, at 
z= 2.2, the baryonic growth rate of haloes of mass 2 X 10'*Mo is 
M~ 200Ma yr, sufficient to maintain the SFR in SFGs. However, 
the margin by which this is sufficient is not large, implying that (1) the 
incoming material must be mostly gaseous, (2) the cold gas must 
efficiently penetrate into the inner halo and (3) the SFR must closely 
follow the gas supply rate. 

The deep penetration is not a trivial matter, given that halo masses 
of M,>10'*M.~ are above the threshold for virial shock heat- 
ing®?!8!, Mghock S 10'*Mo. Such haloes are encompassed by a stable 
shock near their outer radius, Ry, inside which gravity and thermal 
energy are in virial equilibrium. Gas falling in through the shock is 
expected to heat up to the virial temperature and stall in quasi-static 
equilibrium before it cools and descends into the inner galaxy”. 
However, at z= 2, these hot haloes are penetrated by cold streams*”””. 
Because early haloes with M, > Mnock populate the massive tail of the 
distribution, they are fed by dark matter filaments from the cosmic 
web that are narrow in comparison with R, and denser than the mean 
within the halo®. The enhanced density of the gas along these filaments 
makes the flows along them unstoppable; in particular, they cool 
before they develop the pressure to support a shock, and thus avoid 
shock heating (Supplementary Information, section 2). 

To investigate the penetration of cold streams, we study the way 
gas feeds massive high-z galaxies in the cosmological MareNostrum 
simulation—an adaptive-mesh hydrodynamical simulation in a co- 
moving box of side length 71 Mpc and a resolution of 1.4 kpc at the 
galaxy centres (Supplementary Information, section 3). The gas maps 
in Figs 1 and 2 demonstrate how the shock-heated, high-entropy, 
low-flux medium that fills most of the halo is penetrated by three 
narrow, high-flux streams of low-entropy gas (Supplementary Figs 
3-6). The penetration is evaluated from the profiles of gas inflow rate, 
M(r), through shells of radius r (Fig. 3, Supplementary Fig. 7). The 
average profile reveals that the flow rate remains constant from well 
outside R, ~ 90 kpc to the disk inside r~ 15 kpc. 

To relate the feeding by streams to the observed abundance of 
galaxies as a function of SFR, we use the MareNostrum inflow-rate 
profiles to evaluate n(>M), the co-moving number density of 
galaxies with an inflow rate >M. We first extract the conditional 
probability distribution P(M|M,) by sampling the M(r) profiles 
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Figure 1| Entropy, velocity and inward flux of cold streams penetrating hot 
haloes. a, b, Maps referring to a thin slice through one of our fiducial 
galaxies with M, = 10'*Mj at z = 2.5. The arrows describe the velocity field, 
scaled such that the distance between the tails is 260 km s__'. The circle marks 
the halo virial radius, Ry. The entropy, log K = log(T/p~”), in units of the 
virial quantities, highlights (in red) the high-entropy medium filling the halo 
out to the virial shock outside Ry. It exhibits (in blue) three radial, low- 
entropy streams that penetrate the inner disk, seen edge-on. The radial flux 
per solid angle is tn = r’ pv, in solar masses per year per square radian, where 
p is the gas density and v, the radial velocity. It demonstrates that more than 
90% of the inflow is channelled through the streams (blue), at a rate that 


uniformly in r, using the fact that the velocity along the streams is 
roughly constant (Supplementary Information, sections 5 and 6). 
This is convolved with the halo mass function”’, n(M,), to give 


Figure 2 | Streams in three dimensions. The map shows radial flux for the 
galaxy of Fig. 1 ina box of side length 320 kpc. The colours refer to inflow rate 
per solid angle of point-like tracers at the centres of cubic-grid cells. The 
dotted circle marks the halo virial radius. The appearance of three fairly 
radial streams seems to be generic in massive haloes at high redshift, and is a 
feature of the cosmic web that deserves an explanation. Two of the streams 
show gas clumps of mass on the order of one-tenth that of the central galaxy, 
but most of the stream mass is smoother (Supplementary Fig. 6). The 

= 10'°m © clumps, which involve about one-third of the incoming mass, are 
also gas rich—in the current simulation only 30% of their baryons turn into 
stars before they merge with the central galaxy. 
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remains roughly the same at all radii. This rate is several times higher than 
the spherical average outside the virial sphere, tity, ~8Mo yr | rad *, 
according to equation (1). The opening angle of a typical stream at R, is 
20° —30°, so the streams cover a total angular area of ~0.4 rad’, namely a few 
per cent of the sphere. When viewed from a given direction, the column 
density of cold gas below 10° K is above 107° cm” for 25% of the area within 
the virial radius. Although the pictures show the inner disk, the disk width is 
not resolved, so associated phenomena such as shocks, star formation and 
feedback are treated in an approximate way only (see density maps and 
additional cases in Supplementary Figs 3-5). K,;,, virial entropy. 


ee) 
n(A)= |" POEM )n(M) aM 
0 
The desired cumulative abundance, n(>M), obtained by integration 
over the inflow rates from M to infinity, is shown at z = 2.2 in Fig. 4. 
Assuming that the SFR equals M, the curve referring to M lies safely 
above the observed values, marked by the symbols, indicating that the 
gas input rate is sufficient to explain the SFR. However, M and the 
SFR are allowed to differ only by a factor of ~2, confirming our 
suspicion that the SFR must closely follow the gas input rate. The 
simulated SFR indeed traces the accretion rate to within a factor of 
two, but, given that our disks are poorly resolved, we focus here on 
the accretion as the more robustly simulated quantity. Because at 
z~2.2 the star-forming galaxies constitute only a fraction of the 
observed ~10''Mo galaxies”*”, the requirement for a SFR almost 
as great as M, based on Fig. 4, becomes even stronger. 

By analysing the clumpiness of the gas streams, using the sharp 
peaks of inflow in the M(r) profiles, we address the role of mergers 
versus smooth flows. We evaluate each clump mass by integrating 
Metump = f (M(r) /v,(r)) dr across the peak, and estimate a mass ratio 
for the expected merger as 1 = Maump/fpMy, ignoring further mass 
loss in the clump on its way in and deviations of the galaxy baryon 
fraction from f,. We use ‘merger’ to describe any major or minor 
merger with 4 = 0.1, as distinct from ‘smooth’ flows, which include 
“‘mini-minor mergers with p< 0.1. We find that about one-third of 
the mass is flowing in as mergers and the rest as smoother flows. 
However, the central galaxy is fed by a clump with p= 0.1 less than 
10% of the time; that is, the duty cycle for mergers is 7 < 0.1. A 
similar estimate is obtained using EPS merger rates’ and starburst 
durations of ~50 Myr at z= 2.5 from simulations” (Supplementary 
Information, section 5). 

From the difference between the two curves of Fig. 4, we learn that 
only one-quarter of the galaxies with a given M are to be seen during a 
merger. The fact that the SFGs lie well above the merger curve even if 
the SFR is ~M indicates that in most of them the star formation is 
driven by smooth streams. Thus, ‘SFG’ could also stand for ‘stream- 
fed galaxy’. This may explain why these galaxies maintain an 
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Figure 3 | Accretion profiles, M(r). Shown is the gas inflow rate through 
spherical shells of radius r, from the disk vicinity to almost twice the halo 
virial radius, obtained by integrating r’ pv, over the whole shell. The thick 
black curve is the average over the simulated galaxies of the fiducial case, 
M, ~ 10'’Mz at z= 2.5. It shows deep penetration at a roughly constant 
rate of ~100M« yr_', consistent with the virial growth rate predicted by 
equation (1). Apparently, the inflow rate does decay as the gas travels 
through the halo, but this decay is roughly compensated for by the higher 
cosmological inflow rate when that gas entered the halo (equation (1)), 
leading to the apparent constancy of accretion rate with radius. The coloured 
curves refer to four representative galaxies, two showing clumps with 

Lt = 0.1 (dashed lines) and two with smoother flows involving only mini- 
minor clumps with ps < 0.1 (solid lines). Clumps with . 2 0.3 appear within 
2R, about once in every ten galaxies; that is, major mergers are infrequent 
(Supplementary Fig. 7). The M(r) profiles serve for extracting the 
conditional probability distribution P(M | M,), leading to the abundance 
n(>M) (Supplementary Fig. 8). 


extended, thick disk while doubling their mass over a halo crossing 
time’. On the other hand, if the SFR is ~M, we learn from Fig. 4 that 
about half of the bright SMGs and most of the fainter SMGs lie below 
the merger curve and are therefore consistent with being merger- 
induced starbursts’’. 

We find that stream-fed galaxies of mass ~10''Mo at z~ 2.5 were 
the most productive star formers in the universe. The constraints on 
the overall SFR density at these epochs imply that SFR has been 
suppressed in smaller galaxies, for example by photo-ionization 
and stellar feedback*””’*. The early presence of low-SFR galaxies**”° 
requires quenching of SFR also at the massive end, perhaps due to 
gravitational heating by destructive streams’. 

The streams are likely also to be responsible for compact spheroids, 
as an alternative to mergers and the associated heating by expanding 
shocks”. Using equation (1), we find that at z= 2 the streams can 
maintain both the high gas fraction and the turbulence necessary for 
the disk to break up into giant clumps by gravitational instability, 
with dispersion-to-rotation ratio ¢/V~ 0.25, as observed***. The 
clumps migrate inward and dissipatively merge into a spheroid'®"'. 
The stream carrying the largest coherent flux with an impact para- 
meter ofa few kiloparsecs determines the disk’s spin and orientation, 
and the stream clumps perturb it. The incoming clumps and the 
growing spheroid can eventually stabilize the disk and suppress star 
formation. We can thus associate the streams with the main mode of 
galaxy and star formation occurring in massive haloes at z~ 2—3; the 
streams that create the disks also make them fragment into giant 
clumps that serve both as the sites of efficient star formation and 
the progenitors of the central spheroid, which in turn helps the 
streams to quench star formation. 

Although wet mergers may grow secondary disks*’, they are not as 
frequent as the observed SFGs (Fig. 4), these disks are neither gas rich 
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Figure 4 | Abundance of galaxies as a function of gas inflow rate, n(>M). 
Shown is the co-moving number density, n, of galaxies with inflow rate 
higher than M at z = 2.2, as predicted from our analysis of the cosmological 
simulation. The upper curve refers to total inflow. It shows that galaxies with 
M>150Moyr ' are expected at a co-moving number density 
n~3xX1074 Mpc ° (similar to estimates in other simulations*”**). Fluxes 
as high as M>500Mo yr ' are anticipated at n~6 X10 °Mpc °. The 
lower curve is similar, but limited to gas input by 4 > 0.1 mergers. The 
symbols represent the vicinity of where the observed massive star-forming 
galaxies can be located once their observed SFRs are identified with M. The 
sBzK and BX/BM galaxies are marked SFG’*. The SMGs respectively brighter 
and fainter than 5 mJy are marked accordingly'*’*. We see that the overall 
gas inflow rate is sufficient for the observed SFR, but the small margin 
implies that the SFR must closely follow the rate of gas supply. Most of the 
massive star formers at a given SFR are expected to be observed while being 
fed by smooth flows rather than undergoing mergers. By studying the 
contribution of different halo masses to the abundance n(>M), we learn that 
the high-SFR SFGs and SMGs are associated with haloes of mass 
10’7M5—10'°M js (Supplementary Fig. 9). An integration of M over halo 
mass and time reveals that most of the stars in the universe were formed in 
stream-fed galaxies, within haloes of mass >2 X 10''Mo at 1.5<z<4. 


nor clumpy enough, and, unlike most SFGs, they are dominated by 
stellar spheroids. 

The cold streams should be detectable by absorption or emission. For 
external background sources, our simulation predicts that haloes with 
M,~10'*Mz at z~ 2.5 should contain gas at temperature <10°K 
with column densities >10°° cm~? covering ~25% of the area at radii 
between 20 and 100kpc, with coherent velocities of <200km sl 
Sources at the central galaxies should show absorption by the radial 
streams in ~5% of the galaxies, flowing in at = 200 kms +, with col- 
umn densities ~10°' cm * (Supplementary Figs 3-5). 
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High-Q surface-plasmon-polariton 
whispering-gallery microcavity 


Bumki Min’?+, Eric Ostby’, Volker Sorger’, Erick Ulin-Avila?, Lan Yang't, Xiang Zhang”” & Kerry Vahala’ 


Surface plasmon polaritons (SPPs) are electron density waves excited 
at the interfaces between metals and dielectric materials’. Owing to 
their highly localized electromagnetic fields, they may be used for 
the transport and manipulation of photons on subwavelength 
scales”. In particular, plasmonic resonant cavities represent an 
application that could exploit this field compression to create ultra- 
small-mode-volume devices. A key figure of merit in this regard is the 
ratio of cavity quality factor, Q (related to the dissipation rate of 
photons confined to the cavity), to cavity mode volume, V (refs 10, 
11). However, plasmonic cavity Q factors have so far been limited to 
values less than 100 both for visible and near-infrared wave- 
lengths'*’. Significantly, such values are far below the theore- 
tically achievable Q factors for plasmonic resonant structures. 
Here we demonstrate a high-Q SPP whispering-gallery microcavity 
that is made by coating the surface of a high-Q silica microresonator 
with a thin layer of a noble metal. Using this structure, Q factors of 
1,376 + 65 can be achieved in the near infrared for surface-plasmonic 
whispering-gallery modes at room temperature. This nearly ideal 
value, which is close to the theoretical metal-loss-limited Q factor, 
is attributed to the suppression and minimization of radiation and 
scattering losses that are made possible by the geometrical structure 
and the fabrication method. The SPP eigenmodes, as well as the 
dielectric eigenmodes, are confined within the whispering-gallery 
microcavity and accessed evanescently using a single strand of low- 
loss, tapered optical waveguide’”"*. This coupling scheme provides a 
convenient way of selectively exciting and probing confined SPP 
eigenmodes. Up to 49.7 per cent of input power is coupled by 
phase-matching control between the microcavity SPP and the 
tapered fibre eigenmodes. 

The subject of high-Q optical micro- and nanocavities has been 
intensively investigated over the last decade. The extremely low photon 
loss rate and small cavity mode volume of photonic-crystal or whis- 
pering-gallery devices offer surprisingly rich physics, spanning many 
areas of research including nonlinear optics, quantum optics, and 
device physics'®'’. Whereas optical micro- and nanocavities made of 
dielectric or semiconducting materials exhibit large Q factors as well as 
small diffraction-limited cavity mode volumes, their metallic 
counterparts (surface-plasmonic cavities'*'®) have been optimized 
primarily for subwavelength-scale miniaturization and have given 
results well below the theoretically predicted performance limit— 
especially in terms of cavity loss—set by ohmic loss in the metal. 
This is believed to result from other loss contributions such as surface 
scattering, radiation, finite cavity mirror reflectance or a significant 
degree of field penetration into the metal. However, these seemingly 
distinct dielectric and plasmonic waveguiding principles can be com- 
bined ina single cavity by using mature optical microcavity technology 
such as that provided by disk’’”® or toroidal microcavities*’. Here we 


propose to utilize a dielectric microcavity, engineered to minimize 
surface blemishes and thereby reduce scattering'®”°, as a template for 
the creation of a surface-plasmonic whispering-gallery microcavity 
with a cavity plasmon-polariton loss rate close to the theoretical limit. 

The proposed plasmonic microdisk cavity structure is sketched in 
Fig. 1a. The plasmonic cavity is composed ofa core silica (silicon dioxide) 
disk microcavity clad in a thin layer of silver (see Methods). Silica micro- 
disk resonators are ideal templates for the study of surface-plasmonic 
whispering-gallery modes primarily because they routinely have optical 
Q factors greater than 1,000,000 (considerably larger than the metal-loss- 
limited Q factor). Using the wedge structure shown in Fig. la, Qfactors as 
high as 6 X 10’ have been demonstrated, showing a remarkably low 
scattering loss value’’. A scanning electron micrograph ofa silver-coated 
SPP microdisk resonator is shown in Fig. 1b, and the corresponding 
expanded view of the edge of the disk resonator is shown in Fig. Ic. 

A full vectorial finite-element analysis was performed for the SPP 
microdisk resonators””’, taking into account the effects of silver” and 
silica’ material dispersion. The theoretical cavity mode dispersion dia- 
gram of an SPP microdisk resonator (Fig. 2a) shows the real part of the 
eigenfrequency, f, of the cavity modes as a function ofan azimuthal mode 
number, m. The vacuum light line is defined by f= mc/2nR, with 
respect to the bottom radius, R,, of the template silica disk microcavity, 
and the silica light line is similarly defined by f= mc/21Ngjjica(f) Ry. Here c 
is the speed of light and 1,;);-q is the refractive index of silica. The eigen- 
modes of an SPP microcavity can be classified into two distinctive 
categories in terms of the cavity mode dispersion: (1) surface-plasmonic 
modes at the metal—dielectric interface and (2) optical dielectric modes 
due to the presence of a dielectric waveguiding channel. 

In the insets of Fig. 2a, the fundamental (first-order) SPP eigen- 
mode, the second-order SPP eigenmode and the fundamental dielec- 
tric eigenmode are plotted for magnetic energy density my = (1/ 
2 Lo) |B, z)|? (where {lg is the permeability of free space) using a 
false-colour map (a conventional cylindrical coordinate system 
(1,,z) is used for the analysis). The SPP eigenmodes of an SPP 
microdisk resonator have electromagnetic energy-density profiles 
that peak at the silica—metal interface in the transverse plane (con- 
stant ¢). The SPP eigenmodes are categorized as SPP ,,,, where qis the 
plasmonic mode number (H(1,¢,z) = Hop (, ze"), and the 
optical dielectric eigenmodes are denoted by DE;,,,, where h is the 
dielectric mode number (H(y, @, z) = Hi (x ze"). The plasmonic 
mode number is defined as the number of antinodes in |HZ;)| along 
the silica—metal interface (excluding the vicinity of the sharp corner 
of the microcavity). Dispersion relations for the four lowest-order 
SPP eigenmodes (q = 1, 2, 3, 4) and the two lowest-order dielectric 
eigenmodes (h = 1,2; see Methods) are plotted in Fig. 2a. 

The cavity mode index, n,, ofa specific eigenmode can be evaluated 
with respect to the dielectric cavity edge (r= Ry) as ne = mc/2mRof- 
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Silver coating (t) 


Figure 1| Tapered fibre waveguide and SPP whispering-gallery microdisk 
resonator. a, SPP microdisk resonator with a tapered optical fibre passing 
under its edge. The wedge-shaped disk edge is a by-product of isotropic 
buffered hydrofluoric acid etching of silica. A transverse cross-section of the 
cavity is shown for clarity. R,, bottom radius; R,, top radius; d, thickness of the 
silica disk resonator; tf, thickness of the metal layer. The straight fibre 
waveguide axis is denoted by the coordinate p and the gap width, d,, is defined 
as the horizontal distance from the dielectric cavity edge to the fibre axis. 

b, Scanning electron micrograph of a fabricated silver-coated SPP microdisk 
resonator (Ry = 10.96 pum, R, = 7.89 Lum, d = 2 tum, tf ~ 100 nm). ¢, Expanded 
view of the edge of the SPP microdisk resonator. 


Figure 2b shows the calculated mode index for modes SPP}, SPPain 
and DE,,,,. The mode index ofa fundamental surface-plasmonic mode 
(SPP},,,) is clearly larger than that of a fundamental dielectric mode 
(DE, ,) within most of the visible and near-infrared frequency band, 
owing to the plasmonic surface-wave characteristics. The mode index 
is important because it determines the phase-matching condition 
for excitation of SPP modes by an input tapered fibre waveguide. 
After n, has been calculated, the corresponding phase-matched 
fibre-waveguide mode index can be approximated as 


2] 
ji es) n(1 p54 7 ey (0) (1) 
./6(2—0) a 15 


where 6 = —d,/R, = 0 denotes the relative gap width (d,, gap width; 
see Methods). To qualitatively describe the effect of gap width vari- 
ation on the phase matching, the HE); mode index of a fibre wave- 
guide with a 1-,1m waist diameter is shown in Fig. 2b. The fibre mode 
index is slightly larger than the SPP ,,, mode index in the near-infrared 
wavelength band. However, owing to the above phase-matching for- 
mula, the SPP,,,, eigenmode can be effectively phase-matched to the 
tapered-fibre eigenmode by increasing the relative gap width. We also 
note that the diameter of the tapered fibre can be optimized to phase- 
match the cavity eigenmodes to the fibre eigenmode because the fibre 
diameter determines the mode index of the fibre eigenmode. 

The calculated cavity Q factors for SPP, eigenmodes as a function of 
azimuthal mode number, m, are plotted in Fig. 2c. The error bounds for 
the imaginary part of the permittivity of silver are taken into account in 
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Figure 2 | Cavity mode dispersion, effective mode index and Q factor. 

a, Cavity mode dispersion curves for an SPP microdisk resonator, calculated 
from finite-element eigenfrequency analysis. For this calculation, the 
thickness of the silver layer is 100 nm, and the bottom and top radii and the 
thickness of the template silica microdisk resonator were set to 11, 7.9 and 
2 um, respectively. Light lines, corresponding to vacuum and silica, are given 
as two black lines (silica material dispersion has been taken into account). 
For clarity, only the four lowest-order SPP eigenmodes and the two lowest- 
order dielectric eigenmodes are plotted. The first- and second-order SPP 
eigenmodes (SPP},,,, SPP>,,) and the fundamental dielectric eigenmode 
(DE},,,) are shown in the inset. b, Effective cavity mode indices, n,, of SPP} 
SPP>,, and DE,,,, (with respect to R,), shown as a function of resonance 
wavelength. The mode index of a tapered-fibre HE); mode is shown to 
demonstrate phase matching. ¢, The theoretical Q factor for SPP},,,, plotted 
as a function of azimuthal mode number, m 


estimating the bounds on the theoretical Q factors and are discussed in 
Supplementary Information”. The calculated Q factors consist of con- 
tributions from intrinsic metal loss (silica material loss is negligible in 
comparison with metal loss***”°) and the geometry- and material- 
dependent radiation loss into free space: Q.'~Qz>1,+ Qj]. 
Therefore, this Q value provides the ideal theoretical limit on the Q 
performance of SPP microdisk resonators that have negligible scattering 
loss induced by surface roughness. The radiation-limited Q factor, Qraas 
is orders of magnitude larger than the metal-loss-limited Q factor, Qmetal} 
the ideal SPP microcavity is thus metal-loss limited (see Methods): 
Q '~ Q> J: In Fig. 2c, the highest fundamental SPP Q factor is found 
to be 1,800, at the resonant wavelength of 1,062.45 nm (m= 85). Ata 
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wavelength of 1,568.25 nm (m = 54), which is close to the value used in 
measurements described below (series 1 in Fig. 3), the theoretical Q factor 
is 1,140 (see Supplementary Information for the lower and upper 
bounds, Q, = 700 and Q, = 2,210). The cavity mode volume, V, and 
the figure of merit Q/V of the SPP microcavity are estimated in 
Supplementary Information. 

To measure the SPP microdisk resonances experimentally, a narrow- 
linewidth (<300 kHz) tunable external-cavity semiconductor laser is 
coupled to the tapered fibre waveguide and scanned over the 1,520- 
1,570-nm wavelength range. The position of the tapered fibre with 
respect to the SPP microdisk resonator is controlled at a fixed vertical 
distance by piezoelectric stages with an encoded resolution of 100 nm, 
and the laser polarization is controlled using a fibre polarization con- 
troller and monitored with a polarimeter. For large overlap between the 
cavity and the waveguide modes, the tapered fibre is positioned under- 
neath the bevelled edge of the resonator, where the silica microdisk is 
free of silver coating. The output transmission is recorded using a 
photodetector and a digital oscilloscope. Figure 3a shows the normal- 
ized transmission spectrum from an SPP microdisk resonator with a 
Lorentzian line-shape fit (Fig. 3a, red curve) to each resonance. Two 
resonances, located at 1,523.59 and 1,532.76 nm (SPP; 3 and DE 74, as 
estimated by calculation), can be clearly identified. An expanded view 
of the scan (main panel modes outlined) is shown in the inset of Fig. 3a 
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Figure 3 | Q-factor measurements for silver-coated and chromium-coated 
microdisk resonators. a, Normalized transmission spectrum showing the 
highest measured SPP Q factor of 1,376 + 65 and a dielectric resonance with 
a Q factor of 4,025 + 262. The inset (main panel outlined) is the entire 
wavelength band (1,520-1,570 nm) scanned for this sample. 

Ry = 15.45 + 0.05 um, R, = 12.73 + 0.04 um, d = 2 um, t~ 100 nm. 

b, Statistical histogram of measured Q values showing the occurrence of each 
eigenmode (SPP and dielectric) for two different sample batches (series 1 
and series 2). Mean (Q) and standard deviation (c) of Q factors are shown in 
the key (series 1, n = 3 measurements; series 2, n = 9). c, Normalized 
transmission spectrum for a chromium-coated microdisk resonator 

(Rp ~ 11 pm, Ry ~ 7.9 um, d = 2 um) with a two-dip Lorentzian fit. 
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and spans three free spectral ranges of SPP and dielectric eigenmodes. 
The cavity Q factor for the fundamental SPP; ,s3 eigenmode is found to 
be 1,376 + 65 (which falls within the theoretical Q-factor range of 
760 S Q < 2,360, with a nominal Q factor of 1,225 for the SPPj.s3 
eigenmode), and that of the fundamental DE,7, mode is 
4,025 + 262. This SPP Q factor of ~1,376 is over 30 times larger than 
the Q factors reported in previous SPP cavity work’*"*. 

To determine the reproducibility of this Q factor, two series of samples 
of different nominal sizes (series 1, R,=10.93,1m; series 2, 
R, = 15.56 um) were tested. The measured Q factors for both the SPP 
and dielectric eigenmodes in the 1,550-nm wavelength band are plotted 
statistically in Fig. 3b. Two separate clusters of Q factors are seen in this 
plot, indicating the distinctive resonant characteristics of the two sorts of 
eigenmode and a tendency for loss to decrease (Q factor to increase) as 
the size of the cavity increases. The measured Q factors are bounded 
within the range predicted for the plasmonic eigenmode (see 
Supplementary Information). To test the metal-dependent resonance 
characteristics of the SPP microdisk, chromium (which is highly lossy at 
optical frequencies) was deposited onto the silica microdisk using the 
same sputtering process, for use in control experiments. The normalized 
transmission spectrum for a chromium-coated microdisk resonator is 
shown in Fig. 3c. In this case, only low-Q resonances (for example 
Q~ 213 at 1,561.92 nm) are observed, owing to the presence of the 
chromium layer. These resonances are primarily of optical dielectric 
origin, as confirmed by finite-element simulations, because the fun- 
damental SPP eigenmodes of a chromium-coated microdisk of this size 
should have a theoretical Q factor of ~10 in the 1,550-nm band. 

To verify the phase-matched excitation of the cavity eigenmodes, a 
series of measurements were performed with variations in the position 
of the tapered fibre waveguide relative to the SPP cavity. Figure 4 shows 
the normalized transmission spectra (for an SPP microdisk from a 
batch from series 2) excited at different gap widths, d,, and also the 
corresponding optical micrographs and relative positions between the 
cavity and the tapered fibre waveguide. Each of the eigenmodes is 
assigned a mode number (Fig. 4a) inferred from finite-element simula- 
tions (Methods). The importance of the phase matching between cavity 
and fibre eigenmodes is manifest in the observed transmission spectra. 
At larger gap widths (d, ~ 0.8, 0.4 um), only the resonances of the first- 
and second-order SPP eigenmodes (SPP},,, and SPP3,,,) are observable, 
and the fundamental dielectric eigenmode (DE;,,) resonances are 
absent. This is because, for this range of larger gap widths, SPP eigen- 
modes are better phase-matched to the fibre eigenmode” and have a 
larger field overlap with the fundamental fibre eigenmode (they are 
located closer to the edge of, and extend farther outside, the microcavity 
than does the fundamental dielectric eigenmode in the wedge-shaped 
structure). As the gap width decreases further (d,=0), the fun- 
damental dielectric eigenmodes are excited, as the phase-matching 
condition can be partly satisfied by decreasing d,. For negative gap 
width, the SPP resonances are even more pronounced, as the phase- 
matching condition between the SPP and fibre eigenmodes can be fully 
satisfied owing to gap-width-induced phase matching, as is shown 
qualitatively in Fig. 2b. For the SPP resonance at 1565.4 nm, an input 
power transfer of up to 49.7% is demonstrated, showing the effective- 
ness of phase-matching control using the tapered fibre waveguide. 

The demonstration of high-Q surface-plasmonic microcavities opens 
many possibilities for applications in fields ranging from fundamental 
science to device engineering. As a specific example, it could make 
possible a plasmonic laser, for which adequate gain materials as well as 
a high-Q SPP cavity are key prerequisites”. Although the demonstrated 
SPP Q factor is still less than that of an optical micro- or nanocavity'®”, 
the corresponding SPP loss coefficient of %spp ~ 271/AQspp = 39 cm! 
(where J is the wavelength) satisfies the experimental criteria for a laser 
cavity and shows that, in principle, such surface-plasmonic lasing devices 
are possible. The tapered-fibre excitation scheme also demonstrates a 
convenient means of exciting these structures and selectively probing 
SPP cavity modes, because it directly controls the mode overlap and 
phase matching between the cavity and fibre eigenmodes (we also note 
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Figure 4 | Transmission spectrum versus waveguide coupling gap. a, Series 
of normalized transmission spectra, recorded for a variety of gap widths 
between the tapered fibre waveguide and the edge of the SPP microdisk. 
Resonances of SPP and dielectric eigenmodes are shown with estimated 
mode numbers. Ry = 15.70 + 0.185 tum, R, = 13.08 + 0.14 um, d= 2 um, 
t~ 100 nm. For the SPP resonance at 1565.4 nm, an input power transfer of 
up to 49.7% is demonstrated (second panel from the top). b, Optical 
micrographs corresponding to the recorded normalized transmission 
spectra. Estimated gap width, d,, is also shown. 


that coupling to a conventional, chip-based waveguide is possible”*). 
Furthermore, it is notable that the SPP Q factor could be substantially 
increased beyond the values measured here by lowering the temperature 
of the SPP microcavity”””’. From a fundamental standpoint, the SPP Q 
factor is sufficient to observe interesting cavity quantum electrodynami- 
cal phenomena in the weak-coupling regime relating to enhanced 
Purcell factors'’”?*°. In addition, using the high nonlinearity of metal 
(or materials deposited in the vicinity of the metal), it may be possible to 
extend the applications of nonlinear plasmonics. Finally, it should be 
noted that, because the /°Q/V values of the present SPP microcavity 
(approximately a few hundred) are still much less than those provided by 
the photonic-crystal and dielectric whispering-gallery microcavities'®"’, 
it is still important to pursue new plasmonic cavity designs. 


METHODS SUMMARY 

Fabrication of SPP microdisk resonators. The template silica microdisks are 
fabricated by photolithography and buffered oxide etching as described else- 
where’””!. During the wet etch, the photoresist is undercut and produces a 
bevelled silica edge, which provides conformal silver coating of the top surface 
of the microdisk. The silver coating is deposited on the template silica microdisks 
using a d.c. sputtering technique with a chamber argon pressure of 30 mtorr. 
Two batches of samples (series 1 and 2) are prepared in this way to investigate the 
size-dependent characteristics of SPP microcavities. 

Phase-matching condition. An asymptotic phase-matching formula can be 
obtained by two different approaches. Using the coupled-mode theory, the 
evaluation of the coupling coefficient « involves the overlap integral of the cavity 
eigenmode and the tapered-fibre eigenmode. To have a non-zero coupling 
strength, the waveguide mode index can be approximated as written in equation 
(1) by setting the @ dependence of the integrand to zero. The same phase- 
matching condition can be found by path-averaging the effective mode index 
seen by the straight fibre waveguide”’. This gives exactly the same formula 
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confirming the asymptotic dependence of phase matching on the relative gap 
width, 6. This formula applies only to the case of negative gap width, that is, 
6=- d,/ Ry =0. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mode number of a dielectric eigenmode. The mode numbers, h = 1, 2,..., of 
the dielectric eigenmodes DE;,,, are assigned in order from lowest- to highest- 
order dielectric eigenmode. Depending on the geometry and the mode number 
h, dielectric eigenmodes can possess certain degrees of plasmonic characteristics 
due to the presence of the metal-silica interface. 

Theoretical Q-factor estimation. From the finite-element eigenfrequency ana- 
lysis, the complex-valued eigenfrequency, f= fre + ifim, can be calculated, and Q 
factors evaluated using the formula Q= f,./2fim. The ranges of theoretical Q 
factors are estimated by using the error bounds in the imaginary part of the 
permittivity of silver. The radiation-limited Q factor can be estimated and sepa- 
rated from the metal-loss-limited Q factor by removing the imaginary part of the 
permittivity of silver. For example, the radiation-limited Q factor for m= 54 
(Fig. 2c) is 3.9 X 10°, and for m = 85 the Q factor is 6.7 X 10°, both of which are 
orders of magnitude larger than the total Q factors. 

Eigenmode identification. To assign mode numbers to the experimentally 
obtained resonance spectra, such as those shown in Fig. 4a, the size of the cavity 
is measured with a scanning electron microscope and the measured geometrical 
dimension is used in the finite-element calculation. Owing to the high sensitivity 
of the resonance frequency with respect to the nanoscale geometrical variation 
and the permittivity of the component materials, only the approximate mode 
numbers can be inferred. There being distinct ranges of Q factors indirectly 
confirms the theoretical SPP and dielectric resonance locations. Then the trans- 
mission of each resonance is experimentally determined by varying the gap width 
and the input polarization to assign distinct resonant characteristics precisely to 
each of the eigenmodes. 
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Warming of the Antarctic ice-sheet surface since the 
1957 International Geophysical Year 


Eric J. Steig', David P. Schneider’, Scott D. Rutherford®, Michael E. Mann’, Josefino C. Comiso” & Drew T. Shindell® 


Assessments of Antarctic temperature change have emphasized the 
contrast between strong warming of the Antarctic Peninsula and 
slight cooling of the Antarctic continental interior in recent 
decades'. This pattern of temperature change has been attributed 
to the increased strength of the circumpolar westerlies, largely in 
response to changes in stratospheric ozone’. This picture, however, 
is substantially incomplete owing to the sparseness and short dura- 
tion of the observations. Here we show that significant warming 
extends well beyond the Antarctic Peninsula to cover most of West 
Antarctica, an area of warming much larger than previously 
reported. West Antarctic warming exceeds 0.1 °C per decade over 
the past 50 years, and is strongest in winter and spring. Although 
this is partly offset by autumn cooling in East Antarctica, the 
continent-wide average near-surface temperature trend is positive. 
Simulations using a general circulation model reproduce the essen- 
tial features of the spatial pattern and the long-term trend, and we 
suggest that neither can be attributed directly to increases in the 
strength of the westerlies. Instead, regional changes in atmospheric 
circulation and associated changes in sea surface temperature and 
sea ice are required to explain the enhanced warming in West 
Antarctica. 

Recent changes in Antarctic ice-sheet surface temperatures appear 
enigmatic when compared with global average temperature trends. 
Although the Antarctic Peninsula is one of the most rapidly warming 
locations on Earth, weather stations on the Antarctic continent generally 
show insignificant trends in recent decades'. However, all but two of the 
continuous records from weather stations are near the coast, providing 
little direct information on conditions in the continental interior. The 
widely used weather forecast reanalysis data are known to have errors 
owing to inconsistent assimilation skill in the satellite and pre-satellite 
eras’. 

In this Letter, we use statistical climate-field-reconstruction tech- 
niques to obtain a 50-year-long, spatially complete estimate of 
monthly Antarctic temperature anomalies. In essence, we use the 
spatial covariance structure of the surface temperature field to guide 
interpolation of the sparse but reliable 50-year-long records of 2-m 
temperature from occupied weather stations. Although it has been 
suggested that such interpolation is unreliable owing to the distances 
involved’, large spatial scales are not inherently problematic if there is 
high spatial coherence, as is the case in continental Antarctica’. 

Previous reconstructions of Antarctic near-surface temperatures 
have yielded inconsistent results, particularly over West Antarctica, 
where records are few and discontinuous*’. We improve upon this 
earlier work in several ways. We use two independent estimates of the 
spatial covariance of temperature across the Antarctic ice sheet: sur- 
face temperature measurements from satellite thermal infrared (Tg) 
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Figure 1| Verification and upper-limit calibration statistics calculated for 
each grid point from the comparison of reconstructed and original satellite- 
derived monthly temperature anomalies. a, Calibration r, 1982—1994.5; 
b, calibration r, 1994.5—2006; ¢, verification r, 1994.5—2006; d, verification r, 
1982-1994.5; e, verification RE, 1994.5—2006; f, verification RE, 
1982-1994.5. Warm colours in e and f (RE scores greater than zero) show 
where results are more accurate than the climatological mean temperature. 
Mean grid-point verification results are RE = 0.11, CE = 0.09 and r= 0.46. 
Crosses show locations of occupied weather stations. 
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observations®, and up-to-date automatic weather station (AWS) mea- 
surements of near-surface air temperature. We use a method”!” 
adapted from the regularized expectation maximization algorithm” 
(RegEM) for estimating missing data points in climate fields. ReEEM 
is an iterative algorithm similar to principal-component analysis, used 
as a data-adaptive optimization of statistical weights for the weather 
station data. Unlike simple distance-weighting”® or similar’ calcula- 
tions, application of RegEM takes into account temporal changes in 
the spatial covariance pattern, which depend on the relative impor- 
tance of differing influences on Antarctic temperature at a given time. 
Furthermore, the iterative nature of RegEM allows it to be used with 
discontinuous time series, permitting us to take full advantage of the 
data available from occupied weather stations. We assess reconstruc- 
tion skill using reduction-of-error (RE) and coefficient-of-efficiency 
(CE) scores as well as conventional correlation (r) scores. Such veri- 
fication metrics are lacking in previous Antarctic temperature 
reconstructions’, but are required for demonstrating skill relative 
to the climatological mean and are therefore critical for confidence 
in the calculation of temporal trends’”. Skill metrics for our Tip-based 
reconstruction from split calibration and verification experiments are 
significant (>99% confidence) at all grid points except in some 
restricted areas, mostly on the eastern side of the Antarctic 
Peninsula (Fig. 1). 

Results from our AWS-based reconstruction agree well with those 
from the Typ data (Fig. 2). This is important because the infrared data 
are strictly a measure of clear-sky temperature® and because surface 
temperature differs from air temperature 2-3 m above the surface, as 
measured at occupied stations or at AWSs. Trends in cloudiness or in 
the strength of the near-surface inversion could both produce spuri- 
ous trends in the temperature reconstruction. The agreement 
between the reconstructions, however, rules out either potential bias 
as significant. Furthermore, detrending of the Tip data before recon- 
struction demonstrates that the results do not depend strongly on 
trends in said data (Supplementary Information). 
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Figure 2 | Reconstructed annual mean Antarctic temperature anomalies, 
January 1957 to December 2006. a, East Antarctica; b, West Antarctica. 
Solid black lines show results from reconstruction using infrared satellite 
data, averaged over all grid points for each region. Dashed lines show the 
average of reconstructed AWS data in each region. Straight red lines show 
average trends of the Tyg reconstruction. Verification results for the 
continental mean of the Tyg reconstruction are RE = 0.34, CE = 0.31 and 
r= 0.73. Grey shading, 95% confidence limits. 
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Our reconstructions show more significant temperature change in 
Antarctica (Fig. 2), and a different pattern for that change than 
reported in some previous reconstructions” (Fig. 3). We find that 
West Antarctica warmed between 1957 and 2006 at a rate of 
0.17 + 0.06 °C per decade (95% confidence interval). Thus, the area 
of warming is much larger than the region of the Antarctic Peninsula. 
The peninsula warming averages 0.11 + 0.04 °C per decade. We also 
find significant warming in East Antarctica at 0.10+0.07°C per 
decade (1957-2006). The continent-wide trend is 0.12 + 0.07 °C per 
decade. In the reconstruction based on detrended Tip data, warming in 
West Antarctica remains significant at greater than 99% confidence, 
and the continent-wide mean trend remains at 0.08 °C per decade, 
although it is no longer demonstrably different from zero (95% con- 
fidence). This is in good agreement with ref. 6, which reported average 
continent-wide warming of 0.082 °C per decade (1962-2003) and 
shows overall warming in West Antarctica, although statistical signifi- 
cance could not be demonstrated owing to the shorter length and 
greater variance of the reconstruction. We emphasize that, in general, 
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Figure 3 | Spatial pattern of temperature trends (degrees Celsius per 
decade) from reconstruction using infrared (T,p) satellite data. a, Mean 
annual trends for 1957—2006; b, Mean annual trends for 1969-2000, to 
facilitate comparison with ref. 2. c-f, Seasonal trends for 1957-2006: winter 
June, July, August; c); spring (September, October, November; d); summer 
(December, January, February; e); autumn (March, April, May; f). Black 
lines enclose those areas that have statistically significant trends at 95% 
confidence (two-tailed t-test). Where it would otherwise be unclear, NS (not 
significant) refers to areas of insignificant trends. Red circles and adjacent 
numbers in a show the locations of the South Pole and Vostok weather 
stations and their respective trends (degrees Celsius per decade) during the 
same time interval as the reconstruction (1957—2006). Black circles in b show 
the locations of Siple and Byrd Stations, and the adjacent numbers show 
their respective trends’ for 1979-1997. 
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detrending of predictand data lowers the quality of reconstructions by 
removing spatial covariance information”®. The detrended reconstruc- 
tion therefore represents a conservative lower bound on trend mag- 
nitude. Although ref. 7 concluded that recent temperature trends in 
West Antarctica are statistically insignificant, the results were strongly 
influenced by the paucity of data from that region. When the complete 
set of West Antarctic AWS data is included, the trends become positive 
and statistically significant, in excellent agreement with our results’. 

Independent data provide additional evidence that warming has 
been significant in West Antarctica. At Siple Station (76° S, 84° W) 
and Byrd Station (80° S, 120° W), short intervals of data from AWSs 
were spliced with 37-GHz (microwave) satellite observations, which 
are not affected by clouds, to obtain continuous records from 1979 to 
1997 (ref. 13). The results show mean trends of 1.1 + 0.8 °C per decade 
and 0.45 + 1.3°C per decade at Siple and Byrd, respectively’*. Our 
reconstruction yields 0.29 + 0.26 °C per decade and 0.36 + 0.37 °C 
per decade over the same interval. In our full 50-year reconstruction, 
the trends are significant, although smaller, at both Byrd 
(0.23 + 0.09 °C per decade) and Siple (0.18 + 0.06°C per decade). 
Furthermore, the seasonal characteristics of these data’’ agree well 
with those from our reconstructions, with the greatest amount of 
warming in austral spring and winter (Fig. 3). Independent analyses 
of tropospheric temperature trends have also found spring and winter 
warming to be greatest in West Antarctica’*”. 

The spatial and seasonal characteristics of our temperature recon- 
struction have important implications for understanding recent 
Antarctic climate change. Several studies have emphasized a warming- 
peninsula, cooling-continent pattern that is attributed to changes in 
atmospheric circulation associated with the southern annular mode 
(SAM)*'®. Cooling over much of East Antarctica did occur in recent 
decades, but was strongest during the short time interval considered in 
earlier studies (1969-2000; Fig. 3b). Virtually all areas warmed 
between 1957 and ~1980. Our reconstruction differs from the results 
of modelling experiments that tie Antarctic surface temperature 
change to stratospheric ozone loss through changes in the SAM'*"’. 
In such simulations, the largest negative temperature anomalies in 
East Antarctica occur in summer, whereas in our reconstruction, 
East Antarctic cooling is restricted to autumn (Fig. 3). The simulations 
show warming in austral summer and autumn, restricted to the 
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peninsula, whereas in our reconstruction the greatest warming is in 
winter and spring, and in continental West Antarctica as well as on the 
peninsula. 

The well-known increases in temperature on the Antarctic Peninsula 
are strongly associated with changes in sea ice’’. Similarly, negative 
anomalies in sea-ice extent”° and the length of the sea-ice season” in 
the Amundsen-Bellingshausen Sea may be related to the warming 
trends we observe in adjacent West Antarctica. To explore this, we 
examined model output from the NASA Goddard Institute for Space 
Studies (GISS) ModelE atmosphere-only and coupled general circula- 
tion models, which were run with multiple oceanic and atmospheric 
boundary conditions until the end of 2003 (ref. 22). A slightly earlier 
atmospheric version of GISS ModelE has been used in simulations of 
circulation anomalies associated with polar stratospheric ozone 
depletion’’. When driven by observed sea-surface-temperature (SST) 
and sea-ice boundary conditions”’, the model reproduces many of the 
basic features of our reconstruction, with warming over most of the 
continent and persistent in West Antarctica (Fig. 4). SST and sea-ice 
changes alone produced weak cooling over parts of East Antarctica 
during the 1980s and 1990s. The details of the comparisons obviously 
depend on the accuracy of the SST and sea-ice observations (the latter 
are not generally considered reliable before 1979), and multi-decadal 
internal variability in the model is substantial. However, it is note- 
worthy that both in the reconstruction and in the model results, the 
rate of warming is greater in continental West Antarctica, particularly 
in spring and winter, than either on the peninsula or in East Antarctica. 
In GISS ModelE, this is related to SST changes and the location of sea- 
ice anomalies, particularly during the latter period (1979-2003), when 
they are strongly zonally asymmetric, with significant losses in the West 
Antarctic sector but small gains around the rest of the continent 
(Fig. 4e). Radiative forcings alone are inadequate to account for the 
observations (Supplementary Information). 

The net impact of SST, sea ice, and radiative forcings on Antarctic 
temperatures in GISS ModelE is in general agreement with our recon- 
struction. The same model, when run in coupled mode (that is, with a 
dynamic ocean) fails to reproduce the strong trends observed in West 
Antarctica and the peninsula. The probable cause of this discrepancy, 
common to other coupled models”, is inadequate representation of 
sea-ice anomalies and their associated higher-order modes of 
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Figure 4 | Comparison of reconstructed and modelled mean annual 
temperature trends (degrees Celsius per decade) for the periods 
1957-1981 and 1979-2003. a—d, 1957-1981; e-h, 1979-2003. 

a, e, Reconstructed surface temperature and observed 25-year change in sea- 
ice fractional area’’. b, f, Surface air temperature from five-member GISS 
ModelE atmosphere-only ensemble simulations with observed sea-surface- 
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temperature and sea-ice boundary conditions. ¢, g, Four-member ensemble 
with the same boundary conditions plus atmospheric forcings (changes in 
atmospheric concentrations of radiatively active species, including ozone). 
d, h, Difference between simulations with the same forcings but observed 
versus climatological sea ice, to isolate the effect of sea ice alone. 
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atmospheric circulation. In this context, it is important that the pattern 
of observed temperature trends closely resembles the pattern of tem- 
perature anomalies associated with the zonal wave-3 pattern in atmo- 
spheric circulation*. This circulation regime is efficient for the 
exchange of air between the ocean and the Antarctic continental inter- 
ior, and is associated with atmospheric circulation anomalies in the 
Amundsen-Bellingshausen Sea, known to precede winter sea-ice 
anomalies”. Forced coupled models, including GISS ModelE, 
generally show a positive shift in the SAM and an associated increase 
in the circumpolar westerlies over recent decades, in good agreement 
with observations’®. Observations also suggest a bias towards the 
positive phase in the wave-3 pattern” since about 1979, which is not 
reproduced in the coupled models. Using observed SST and sea ice, 
GISS ModelE does produce substantial shifts in the wave-3 circulation. 
Under those model conditions, greater cyclonic flow in the Amundsen 
Sea region brings warm, moist air to West Antarctica, countering the 
effect of the enhanced circumpolar westerlies. 

An outstanding question in Antarctic climatology has been 
whether the strong warming of the peninsula has also occurred in 
continental West Antarctica’’. Our results indicate that this is indeed 
the case, at least over the last 50 years. Moreover, ice-core analyses 
indicate average warming of West Antarctica over the entire twen- 
tieth century”. Although the influence of ozone-related changes in 
the SAM has been emphasized in recent studies of Antarctic temper- 
ature trends, the spatial and seasonal patterns of the observed tem- 
perature trends indicate that higher-order modes of atmospheric 
circulation, associated with regional sea-ice changes, have had a lar- 
ger role in West Antarctica. 

Mean surface temperature trends in both West and East Antarctica 
are positive for 1957-2006, and the mean continental warming is 
comparable to that for the Southern Hemisphere as a whole”. This 
warming trend is difficult to explain without the radiative forcing 
associated with increasing greenhouse-gas concentrations. However, 
the future trajectory of Antarctic temperature change also depends on 
the extent to which changes in atmospheric composition (whether 
from greenhouse gases or stratospheric ozone) affect Southern 
Hemisphere sea ice and regional atmospheric circulation patterns. 
Improved representation in models of coupled atmosphere/sea-ice 
dynamics will be critical for forecasting Antarctic temperature change. 


METHODS SUMMARY 


We use near-surface air temperature data from 42 occupied stations and 65 AWSs 
from the READER (Reference Antarctic Data for Environmental Research) data 
set’. We use passive infrared brightness measurements (Tj) of surface temper- 
ature from the Advanced Very High Resolution Radiometer’, a satellite of the US 
National Oceanic and Atmospheric Administration. We use the RegEM 
algorithm”! to combine the data from occupied weather stations with the Tip 
and AWS data in separate reconstructions of the near-surface Antarctic temper- 
ature field. Split calibration/verification tests are performed by withholding pre- 
and post-1995 T;, and AWS data in separate RegEM calculations. Calibration and 
verification statistics are calculated for each grid point from the comparison of the 
reconstructed time series and the original temperature time series. We show RE 
and correlation rvalues in Fig. 1. CE verification values yield results indistinguish- 
able from RE in our study and are reported in Supplementary Information. 
Significance levels of the calibration/verification statistics are based on Monte 
Carlo simulations of red noise as the null hypothesis. In Fig. 2, the 95% confidence 
interval is the unexplained variance, 20, where 02,,4, = Odata(1 — Teer)» Tata is the 
temporal variance in the original satellite temperature data and r2,, is the veri- 
fication fractional resolved variance. Significance levels of trends are calculated 
using a two-tailed t-test, with the number of degrees of freedom adjusted for 
autocorrelation. In reporting trends for different areas, we define West 
Antarctica as 72°—90° S, 60°—180° W; East Antarctica as 65°—90° S, 300—180° E; 
and the Antarctic Peninsula as westerly longitudes north of 72° S. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 
Data. We use the READER weather station temperatures from the British 
Antarctic Survey'. Twenty-seven of the 42 occupied stations have at least 50%- 
complete monthly average data from 1957 to present. Data from 65 AWSs are 
available, but are discontinuous and date from 1980 at the earliest. In addition, 
data from only 24 of the AWSs are more than 50% complete for 1980-2006. We 
use passive infrared brightness measurements (Tip) from the Advanced Very 
High Resolution Radiometer, which are continuous beginning January 1982 and 
constitute the most spatially complete Antarctic temperature data set. The Tig 
data are biased towards clear-sky conditions, owing to the opacity of clouds in 
the infrared band. Cloud masking is probably the largest source of error in the 
retrieval of Ty data from raw satellite spectral information. We have updated the 
data throughout 2006, using an enhanced cloud-masking technique to give 
better fidelity with existing occupied and automatic weather station data. We 
make use of the cloud masking in ref. 8 but impose an additional restriction that 
requires that daily anomalies be within a threshold of +10 °C of climatology, a 
conservative technique that will tend to damp extreme values and, hence, mini- 
mize trends”. Values that fall outside the threshold are removed. 
Calculations. We use the RegEM algorithm"’, developed for sparse data infilling, 
to combine the occupied weather station data with the T;, and AWS data in 
separate reconstructions of the Antarctic temperature field. RegEM uses an 
iterative calculation that converges on reconstructed fields that are most con- 
sistent with the covariance information present both in the predictor data (in this 
case the weather stations) and the predictand data (the satellite observations or 
AWS data). We use an adaptation of RegEM in which only a small number, k, of 
significant eigenvectors are used'°. Additionally, we use a truncated total-least- 
squares (TTLS) calculation®’ that minimizes both the vector b and the matrix A 
in the linear regression model Ax = b. (In this case A is the space-time data 
matrix, b is the principal component time series to be reconstructed and x 
represents the statistical weights.) Using RegEM with TTLS provides more 
robust results for climate field reconstruction than the ridge-regression method 
originally suggested in ref. 11 for data infilling problems, when there are large 
differences in data availability between the calibration and reconstruction inter- 
vals'°. For completeness, we compare results from RegEM with those from 
conventional principal-component analysis (Supplementary Information). 
Monthly average surface temperature anomalies were obtained from the Tir 
data for the domain covering all land areas and ice shelves on the Antarctic 
continent, at 50 km X 50km resolution*. The monthly anomalies are efficiently 
characterized by a small number of spatial weighting patterns and corresponding 
time series (principal components) that describe the varying contribution of each 
pattern. The results are reproducible using single-season, annual average and 
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split-decade-length subsets of the data*. The first three principal components 
are statistically separable and can be meaningfully related to important dynamical 
features of high-latitude Southern Hemisphere atmospheric circulation, as 
defined independently by extrapolar instrumental data. The first principal com- 
ponent is significantly correlated with the SAM index (the first principal com- 
ponent of sea-level-pressure or 500-hPa geopotential heights for 20° S—90° S), and 
the second principal component reflects the zonal wave-3 pattern, which con- 
tributes to the Antarctic dipole pattern of sea-ice anomalies in the Ross Sea and 
Weddell Sea sectors**. The first two principal components of Ti alone explain 
>50% of the monthly and annual temperature variabilities’. Monthly anomalies 
from microwave data (not affected by clouds) yield virtually identical results’. 

Principal component analysis of the weather station data produces results 
similar to those of the satellite data analysis, yielding three separable principal 
components. We therefore used the RegEM algorithm with a cut-off parameter 
k= 3. A disadvantage of excluding higher-order terms (k > 3) is that this fails to 
fully capture the variance in the Antarctic Peninsula region. We accept this trade- 
off because the Peninsula is already the best-observed region of the Antarctic. 
Statistics. We obtained calibration/verification statistics by withholding the first 
and last 12.5 years of the 25-year Tir data in separate RegEM calculations. We 
similarly split the AWS data into pre- and post-1995 data. Confidence levels are 
based on Monte Carlo simulations of red noise as the null hypothesis. For each grid 
point, 1,000 red noise series were generated to have the same mean, variance and 
lag-1 autocorrelation coefficient as the actual time series over the calibration period. 
Additional validation of the Tjp-based reconstruction was obtained by using the 15 
occupied weather stations with the most complete data, reserving the other 27 for 
verification. Verification metrics at these sites are consistently significant at >99% 
confidence, with the exception of some sites at the tip of the Antarctic Peninsula and 
the three sites north of 55° S (Supplementary Information). 

We report verification statistics as well as upper-bound calibration-interval 
statistics. The latter represent the maximum level of explained variance that 
could be expected in the reconstruction, given how much data variance is 
resolved over the calibration interval. We rely primarily on the RE statistic; 
the alternative verification statistic, CE, yields indistinguishable results in our 
study (Supplementary Information). For completeness, we also report correla- 
tion r values, but with the recognition that ris a deficient skill metric because it 
does not penalize the poor prediction of either means or variances’®. 
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A simple model of bipartite cooperation for ecological 
and organizational networks 


Serguei Saavedra’, Felix Reed-Tsochas** & Brian Uzzi>*® 


In theoretical ecology, simple stochastic models that satisfy two basic 
conditions about the distribution of niche values and feeding ranges 
have proved successful in reproducing the overall structural 
properties of real food webs, using species richness and connectance 
as the only input parameters’ ~*. Recently, more detailed models have 
incorporated higher levels of constraint in order to reproduce the 
actual links observed in real food webs”*. Here, building on previous 
stochastic models of consumer-—resource interactions between 
species’ °, we propose a highly parsimonious model that can repro- 
duce the overall bipartite structure of cooperative partner—partner 
interactions, as exemplified by plant—animal mutualistic networks’. 
Our stochastic model of bipartite cooperation uses simple speciali- 
zation and interaction rules, and only requires three empirical input 
parameters. We test the bipartite cooperation model on ten large 
pollination data sets that have been compiled in the literature, and 
find that it successfully replicates the degree distribution, nestedness 
and modularity of the empirical networks. These properties are 
regarded as key to understanding cooperation in mutualistic 
networks*"°. We also apply our model to an extensive data set of 
two classes of company engaged in joint production in the garment 
industry. Using the same metrics, we find that the network of 
manufacturer—contractor interactions exhibits similar structural 
patterns to plant—animal pollination networks. This surprising 
correspondence between ecological and organizational networks 
suggests that the simple rules of cooperation that generate bipartite 
networks may be generic, and could prove relevant in many different 
domains, ranging from biological systems to human society''™*. 

In ecology, the collection and analysis of empirical data on mutualistic 
networks’ can help identify the most significant features that have 
evolved in bipartite cooperative networks. In these networks, species in 
one class (class A, for animals) cooperate with species in a second class 
(class P, for plants) to mutual advantage. Species in class P offer rewards 
with certain characteristics determined by reward traits, which may also 
have evolved to reduce exploitation and favour mutualism”». Species in 
class A foraging for resources can benefit from the rewards offered by a 
given species in class P if the respective foraging traits (for example 
efficiency, morphology and behaviour) and reward traits (for example 
quantity, quality and availability) are complementary*"®. External factors 
such as the environmental context (for example population density and 
geographic and temporal variation) attenuate or amplify the value of 
reward and foraging traits, and have an impact on the number of poten- 
tial partners with which a given species cooperates'”’. Furthermore, 
mutualistic networks exhibit constraints introduced by phylogenetic 
relationships between species in the same class, which impact mutualistic 
interaction patterns by favouring ecological similarity”. 

Recently, mutualistic models*’”* have been proposed which incor- 
porate the effects of complementarity and exploitation or growth bar- 
riers on interactions between species in classes A and P (Supplementary 


Information). Although a mixed model”' and a differential limiting 
size model” have proved successful at generating structural patterns 
similar to those observed in real mutualistic networks, we find that they 
are able to reproduce less than 25% of the total number of observed 
metrics in different ecological networks (Supplementary Tables 1-4). 

In organizational networks, cooperation between two different 
classes of company (manufacturers and contractors) is subject to 
equivalent structural constraints, which depend on the traits of the 
companies and the complementarity between traits of potential part- 
ners, as well as hierarchical relationships between companies in the 
same class”. Companies are characterized by a set of organizational 
traits’ (for example company size, competitive niche space and 
brand positioning), and face interaction barriers generated by differ- 
ences in status, which limit the number and range of potential 
partners. Again, the values associated with trait complementarity 
and interaction barriers are not absolute, but are modulated by the 
specific market context in which the companies operate**”’. 

Inspired by previous food-web models’, we develop a new model 
of bipartite cooperation that can reproduce more than 70% of the 
total number of observed metrics in both ecological and organiza- 
tional networks (Table 1, Supplementary Tables 3, 4). In the corres- 
ponding ecological (species-based) and organizational (company- 
based) contexts, plants and manufacturers are treated as members 
of class P, and animals and contractors as members of class A. The 
three inputs for the model are the size of class A, the size of class Pand 
the total number of links, L, all of which are given directly by the 
empirical data (Table 1). The model consists of two mechanisms (see 
Methods for a description of the model). 

The first mechanism is specialization. The specialization rule deter- 
mines how many partners, J,, each member p € P will cooperate with. 
This number is determined by the reward value associated with p € P, 
which is given by the reward trait, fp, attenuated or amplified by an 
external factor 2, that accounts for effects such as geographic variation 
and population diversity. Higher reward values increase the number of 
potential interactions. Reward traits fg, are the result of a hierarchical 
process, which in our model corresponds to the generation of an 
ordered sequence in trait space, so fg, has a role equivalent to that of 
the niche value in the niche model' or nested-hierarchy model’. Note 
that the specialization rule is only associated with class P. This is con- 
sistent with previous findings which show that external factors affect 
the level of specialization more strongly in plants than in animals"*. 

The second mechanism is interaction. The interaction rule deter- 
mines which members a € A cooperate with each member p € P. 
Here interactions are limited by the complementarity between the 
reward traits, fr», for p © Pand foraging traits (organizational traits), 
tea for a € A. The foraging trait fp, limits the range of possible 
partners for each member of class A, but again external factors 1jp 
such as temporal variation and population density can modify these 
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Table 1| Empirical values and model statistical significance 


Data set/environment L |P| |A| KSp-KS, N Q 
Marsh (Japan) 430 64 187 0.326*-0.438* 0.976* (0.969) 0.551* (0.553) 
Grassland (Cass, New Zealand) 374 Al 139 0.633*-0.385* 0.957* (0.960) 0.474+ (0.465) 
Subalpine forest/meadow (Japan) 865 90 354 0.552*-0.0018 0.9857 (0.976) 0.545+ (0.532) 
Subalpine (Arthur's Pass, New Zealand) 120 18 60 0.108+-0.999* 0.8588 (0.936) 0.553t (0.527) 
Subalpine (Craigieburn, New Zealand) 346 49 118 0.0028-0.0018 0.961* (0.955) 0.4807 (0.468) 
Tundra (Canada) 79. 29 81 0.097+-0.989* 0.971+ (0.950) NM 
Scrub/snow gum forest (Australia) 252 36 81 0.608*-0.076+ 0.935* (0.949) NM 
Deciduous forest (USA) 65 7 33 0.911*-0.642* 0.953+ (0.930) NM 
Arctic tundra (Greenland) 453 31 75 0.038t-0.118+ 0.7938 (0.914) NM 
Subarctic rock slope (Sweden) 242 24 118 0.223+-0.005t 0.927+ (0.952) NM 


New York garment industry, 1985 
New York garment industry, 1991 
New York garment industry, 1997 
New York garment industry, 2003 228 62 128 


0.0614-0.115+ 
0.1017-0.531i 


0.997+ (0.996) 
0.994* (0.993) 
0.003}-0.264+ 0.990+ (0.988) 0.653¢ (0.625) 
0.370+-0.002¢ 0.976+ (0.969) 0.711+ (0.700) 


For each pollination data set and the four organizational networks used in this paper, the table presents its environment/location; the total number of links, L; the respective numbers of nodes, |P| 
and |A |, in classes P and A; the combined Kolmogorov-Smirnov (KS) probabilities, KSp-KS,, calculated for the degree distributions using the two-group equivalence KS test between the empirical 
and model-generated distributions for classes P and A, respectively; the observed nestedness, N; and the observed mean modularity value, Q. Note that all networks have a ratio |P|/|A| <0.5, 
which has been found to be an important factor limiting the appearance of scale-free distributions**. The model-generated mean values for N and Q are shown inside parentheses. Five of the observed 
pollination networks have already been found to be non-modular (NM)"°. All comparisons are based on 1,000 model simulations. Note that the model reproduces more than 70% of the overall 
number of observed metrics with a good or excellent fit (27 out of 35 and 11 out of 16 for the ecological and organizational networks, respectively). 

* KS = 0.30, normalized errors less than one model s.d. (excellent fit). KS < 0.30, normalized errors between one and two model s.d. (good fit). [KS < 0.05, normalized errors between two and 
three model s.d. (poor fit). 8KS < 0.01, normalized errors greater than three model s.d. (bad fit). 


0.5988 (0.502) 
0.6018 (0.529) 


interaction barriers. In both the specialization and interaction rules, 
we assume that traits are normally distributed and that the effect of 
external factors on members is inhomogeneous and a function of the 
diversity of members and the density of interactions in the network. 

Wetest our model using data from real-world ecological and organi- 
zational networks. For the ecological networks, we use a diverse set of 
ten extensive plant—animal pollination networks compiled in the lite- 
rature (Supplementary Information), which can clearly be distin- 
guished from random assemblages’. For organizational networks, we 


use a data set of approximately 700,000 yearly bilateral production 
transactions between more than 10,000 manufacturers and contractors 
in the New York garment industry (NYGI) from January 1985 to 
December 2003 (Supplementary Information)*’. The NYGI exhibits 
a significant turnover of companies each year with different declining 
trajectories for each class (Supplementary Fig. 1), so the network struc- 
ture at one time does not trivially map into the structure at other times. 
For each network, we investigate three key features of bipartite 
cooperation*””: (1) degree distribution”, (2) nestedness’””** and (3) 


modularity”. 


2 1 First we consider the degree distribution. In Fig. la and b we 
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Figure 1| Scaled degree distribution. a, The cumulative degree 
distribution, P.m(k), for members of class P (plants, manufacturers); 

b, Pcum(k) for members of class A (animals, contractors). The number of 
partners, k, is scaled by a multiplicative factor of 1/zp for members of P and 
1/z, for members of A, where zp = L/P and z, = L/A. Filled symbols 
correspond to pollination networks and open symbols to organizational 
networks. Note that all distributions collapse into a single curve. The solid 
line corresponds to the model-generated distributions averaged over 1,000 
simulations (Supplementary Fig. 2). 


0.92 LX 
0.92 0.93 0.94 0.95 0.96 0.97 0.98 0.99 1 
Observed 


Figure 2 | Nestedness. a, b, Comparisons of the randomly generated (green 
crosses) and model-generated (orange circles) nestedness values 

(mean + 2s.d.) with the observed nestedness values for all the pollination 
(a) and the NYGI (b) networks. The dashed lines correspond to there being 
perfect agreement between predicted and observed values. Note that a 
matrix with a nestedness value of one is perfectly nested. 
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and members of class A. Note that pollination networks (filled sym- 
bols) and organizational networks (open symbols) exhibit the same 
patterns in both degree distributions. The solid line corresponds to the 
model-generated degree distributions (see also Supplementary Fig. 2). 
Table 1 provides statistical evidence for the correspondence between 
the empirical and model-generated distributions for both ecological 
and organizational networks. 

To calculate the nestedness level, N, of the empirical and model- 
generated networks, we use the BINMATNEST program’. Here 
nestedness is defined in the interval [0,1], where 1 corresponds to 
a perfectly nested network’. Figure 2 shows the nestedness values for 
empirical networks (dashed line), random assemblages generated 
following the null model II proposed in ref. 9 (green crosses) and 
model-generated networks (orange circles). Note that the model 
always significantly outperforms random assemblages. Table 1 pre- 
sents tests of statistical significance and shows that there is a high level 
of correspondence between the model and data for most of the net- 
works (Supplementary Fig. 3). 

Modularity values, Q, for the networks are calculated using the one- 
mode optimization algorithm proposed in ref. 30, because we want to 
extract only cooperative units from the network. This algorithm has 
shown that only five of the observed ecological networks are truly 
modular’? (Table 1). For these five networks, we find a strong corres- 
pondence between the empirical and model-generated modularity 
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values (Table 1). For each organizational network, we find a modu- 
larity value that is higher than that generated from the corresponding 
random assemblages*” (P< 107°), and this empirical behaviour is 
replicated by the model (Table 1). Nodes have been shown to have 
different connectivity roles (for example global or local hubs) depend- 
ing on how they are embedded within their module and participate in 
other modules*’. Here we also test the ability of our model to accur- 
ately reproduce the empirically observed number of nodes within each 
connectivity role*’ (Fig. 3). To do so, we measure the Pearson correla- 
tion coefficient, r, and the ratio of the connectivity role norms, d, for 
the observed and model-generated networks (Methods). We consis- 
tently find values aligned with the empirical measurements for the 
ecological and organizational networks (r= 0.98, 0.9<d< 1.1). 
Our study identifies striking similarities in the general structural 
characteristics of networks that are formed as a result of cooperative 
mechanisms operating in radically different contexts, linking part- 
ners in ecological and socio-economic systems, respectively. This 
empirical finding motivates the proposed simple model for bipartite 
cooperation, which captures the most important generic features of 
mutualistic interaction patterns starting from a minimal set of input 
parameters. At the level of partner—partner interactions, equivalent 
behaviour in different systems appears to be driven by similar types of 
interaction constraints. These correspond to complementarity in 
traits or characteristics, a hierarchical organization limiting the range 
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Figure 3 | Connectivity roles. a, b, Division of connectivity roles (R1—R7) 
for members of the subalpine (Japan) network (a) and the 1997 NYGI 
network (b). Plants and manufacturers are indicated with blue circles and 
animals and contractors with red triangles. Nodes with a normalized within- 
module degree (z score) z= 2.5 have been heuristically defined as module 
hubs, and nodes with z < 2.5 as non-hub nodes”’. In addition, hubs and non- 
hub nodes are classified according to a participation coefficient, P., which 
gives the level of interaction of a node with the rest of the modules. The 
complete classification is as follows. For non-hub nodes, R1 comprises nodes 
with all their links connected within their own module (P. = 0.05), R2 
comprises nodes with most of their links connected within their own module 
(0.05 < P, = 0.62), R3 comprises nodes with many links connected to other 
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modules (0.62 < P, = 0.80) and R4 comprises nodes with their links 
homogeneously connected to all other modules (P. > 0.80). For hub-nodes, 
R5 comprises nodes with most of their links connected to their own module 
(P, = 0.30), R6 comprises nodes with most of their links connected to other 
modules (0.30 < P.=0.75) and R7 comprises nodes with their links 
homogeneously connected to all other modules (P. > 0.75).¢, d, Proportions 
of nodes from the population within each connectivity role for the modular 
pollination networks (c) and the NYGI networks (d). Points (various 
symbols) correspond to the empirical values and bars (mean = s.d.) 
correspond to the model-generated values. Note that the model accurately 
replicates the proportion of nodes within each connectivity role for both 
types of network. 
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of potential partners, and the environmental context. The success of 
this simple stochastic model in generating the overall structural char- 
acteristics of mutualistic networks makes it a suitable starting point 
for the development of more elaborate ecological models, with the 
aim of addressing the important question of reproducing the actual 
links observed in real mutualistic webs. Such an approach would 
require more comprehensive statistical comparisons across different 
network metrics using maximum-likelihood methods®. Beyond this 
ecological context, and the specific organizational network that has 
been considered, the generic nature of the bipartite cooperation 
model and its underlying assembly rules suggests that it should be 
relevant to many other cooperative networks in domains beyond 
those considered here. 


METHODS SUMMARY 

Model. For the specialization rule, the number of links, /,, for each node p € Pis 
defined by Ip=1+Round((L—|P|)tppAp/ )__, trjAj), where the reward trait tr, 
is uniformly drawn from [0, 1], the external factor Ap is randomly drawn from an 
exponential distribution, |*| denotes set cardinality and Round(-) is the nearest- 
integer function. 

For the interaction rule, nodes from class P are sorted according to their 
reward trait tg, in ascending order. Nodes from class A are sorted in descending 
order according to their foraging traits t;,, which are uniformly drawn from 
[0, 1]. Starting from the first node, p; and continuing sequentially subject to 
trpi> Aip» each link |; is connected to the first node a’ € A’, where A’ is the 
subset of nodes in A that have not already been linked to by another node p ¥ pj. 
Here jp; is an external factor drawn randomly from the same exponential dis- 
tribution as /,;. If tap; = /1p; then the link is randomly connected to another node 
a'' © A'', where A”’ is the subset of nodes in A that have been linked in a 
previous time step. If the supply of nodes in either A’ or A’’ is exhausted before 
all J,; links have been allocated, then nodes in the other subset are linked to 
instead. The model is initialized by connecting the first node, p, to [,;nodes in A’. 

The exponential distribution used for i, and Ap, is of the form 

p(x) = Bexp(—fx), with fB = |P\(|A| — 1)/(2(L— |P|)) — 1. If we generalize our 
formalism and use a beta distribution’’, we do not find significantly different 
results (Supplementary Table 5). 
Ratio of connectivity role norms. The ratio of the norms, d, is defined by 
d=|x|/|y|, where |x| =(S0", x7)", |y| =(92), v2)", m is the number of 
connectivity roles and x; and y; are the numbers of nodes within each role i for 
the empirical and, respectively, model-generated networks. Ratios within 
(0.9, 1.1) are fractions with a norm comparable to the empirical data. 
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Ferritin is used for iron storage in bloom-forming 


marine pennate diatoms 


Adrian Marchetti'*, Micaela S. Parker’*, Lauren P. Moccia’, Ellen O. Lin’, Angele L. Arrieta’, Francois Ribalet’, 
Michael E. P. Murphy, Maria T. Maldonado? & E. Virginia Armbrust’ 


Primary productivity in 30—40% of the world’s oceans is limited by 
availability of the micronutrient iron’”. Regions with chronically 
low iron concentrations are sporadically pulsed with new iron 
inputs by way of dust’ or lateral advection from continental 
margins*. Addition of iron to surface waters in these areas induces 
massive phytoplankton blooms dominated primarily by pennate 
diatoms”®. Here we provide evidence that the bloom-forming 
pennate diatoms Pseudo-nitzschia and Fragilariopsis use the 
iron-concentrating protein, ferritin, to safely store iron. Ferritin 
has not been reported previously in any member of the 
Stramenopiles, a diverse eukaryotic lineage that includes unicel- 
lular algae, macroalgae and plant parasites. Phylogenetic analyses 
suggest that ferritin may have arisen in this small subset of diatoms 
through a lateral gene transfer. The crystal structure and functional 
assays of recombinant ferritin derived from Pseudo-nitzschia 
multiseries reveal a maxi-ferritin that exhibits ferroxidase activity 
and binds iron. The protein is predicted to be targeted to the 
chloroplast to control the distribution and storage of iron for 
proper functioning of the photosynthetic machinery. Abundance 
of Pseudo-nitzschia ferritin transcripts is regulated by iron nutri- 
tional status, and is closely tied to the loss and recovery of 
photosynthetic competence. Enhanced iron storage with ferritin 
allows the oceanic diatom Pseudo-nitzschia granii to undergo 


Figure 1| Tertiary crystal structure and ferroxidase site of Pseudo- 
nitzschia multiseries ferritin. a, b, Crystal structure of the recombinant 

P. multiseries ferritin: a, multimer (24mer); b, monomer. o-helices and 
ferroxidase centre side chains: blue, A helix; orange, B helix; green, C helix; 
yellow, D helix; red, E helix. Iron atom (pink) positions are according to the 
crystal structure. c, Ferroxidase site A is occupied by a water molecule (grey 


several more cell divisions in the absence of iron than the compar- 
ably sized, oceanic centric diatom Thalassiosira oceanica. Ferritin 
in pennate diatoms probably contributes to their success in chronic- 
ally low-iron regions that receive intermittent iron inputs, and 
provides an explanation for the importance of these organisms in 
regulating oceanic CO, over geological timescales”*. 

Ferritin is an iron-storage protein used by plants, animals, cyano- 
bacteria and other microorganisms to safely concentrate and store 
iron, thereby minimizing potential cell damage from reactive oxygen 
species and oxidative stress. Ferritin subunits from eukaryotes 
assemble into nanocages capable of storing up to 4,500 Fe(II) atoms 
as an iron oxide mineral by oxidizing iron at ferroxidase centres and 
releasing reduced iron on cellular demand’. We identified an 
expressed sequence tag (EST) for a gene encoding a ferritin-like pro- 
tein in the pennate diatom Pseudo-nitzschia australis, and subse- 
quently generated full-length sequences for this gene from P. 
australis and P. multiseries and partial sequences from additional 
species including oceanic P. granii. The predicted amino acid 
sequence of Pseudo-nitzschia ferritin displays general conservation 
of residues that form the ferroxidase centres; it also possesses the 
requisite signal peptide and plastid transit peptide motifs’® to target 
ferritin subunits to the plastid (Supplementary Fig. 1), where they are 
expected to both store iron away from reactive oxygen as Fe(1I1) and 


c 


sphere). One iron atom (pink sphere) occupies site B at full occupancy, and 
is coordinated by four glutamate residues (Glu 44, Glu 48, Glu 94 and 
Glu 130) shown as stick models. An iron atom at site C and a third iron atom 
are refined at occupancies of 50% and 40%, respectively. A difference 
anomalous dispersion map around the iron atoms is contoured at 3o. 
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make Fe(11) available for iron-requiring processes, including pho- 
tosynthesis. Residues that line the three-fold channels to facilitate iron 
release from the mineral core’! are either conserved or conserved 
substitutions (Supplementary Fig. 1). 

Characterization of recombinant P. multiseries ferritin (rPmFTN) 
was used to demonstrate that the protein displays features typical of 
maxi-ferritins. The apparent molecular mass of rPmFTIN is 
~530kDa, consistent with formation of a 24-subunit structure 
(Supplementary Fig. 2). As measured by an aerobic oxidative iron 
uptake assay, rPmFTN displayed ferroxidase activity through the 
formation of oxy/hydroxo Fe(1I) species (Supplementary Fig. 3a)'’. 
Under these assay conditions, more than 600 iron atoms were loaded 
per apo-ferritin nanocage. The ferroxidase reaction consumed oxygen 
in a ratio of 1.9 + 0.2 Fe(11):O2 (Supplementary Fig. 3b), consistent 
with the proposed di-ferrous binding of oxygen at ferroxidase sites in 
maxi-ferritins'*’’. Following the consumption of O3, addition of cata- 
lase to the assay solution resulted in stoichiometric regeneration of O2, 
indicating net production of HO, by the ferroxidase reaction as seen 
with many ferritins'’, but not bacterioferritins'* or Dps ferritins’’. 

The crystal structure of rPmFTN was resolved to 1.95A and 
provided further support that the P. multiseries protein functions 
as a ferritin in vivo. The structure revealed an assembly of 24 mono- 
mers with a typical maxi-ferritin arrangement (Fig. 1). The monomer 
fold is most similar (<1.3A root mean squared deviation, 155 
aligned residues) to those of eukaryotic, bacterial and archeal ferri- 
tins, and less similar to bacterioferritins and Dps ferritins. Also, the 
ferroxidase site is conserved. Three iron atoms are observed in its 
vicinity; one iron atom is coordinated by conserved ferroxidase site B 
residues and a fourth ligand (Glu 44), and the other two iron atoms 
are positioned towards the core (Fig. 1c and Supplementary Fig. 4). 
Ferroxidase site A is occupied by a water molecule. 

Iron nutritional status affected growth rates (4) and ferritin tran- 
script abundance for both coastal P. multiseries (North Atlantic) and 
oceanic P. granii (iron-limited northeast Pacific). P. multiseries had a 
reduced wU (L/Lmax = 0.2; Fig. 2a) at a low total iron concentration, 
[Fey], of 6.2 nmoll” ' (see Supplementary Table 1 for other iron species’ 
estimates). In contrast, P. granii maintained near-maximum [Ll 
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Figure 2 | Growth characteristics and ferritin transcript abundance in 
Pseudo-nitzschia. a, Growth rates ({1) of coastal P. multiseries (black filled 
circles) and oceanic P. granii (open circles) as a function of total iron 
concentration, [Fey]. b, Relative transcript abundance of ferritin normalized 
to actin (FTN:ACT) in P. multiseries (black filled squares) and P. granii 
(open squares) as a function of [Fey]. ¢, Maximum photochemical yield of 
photosystem II (F,/F,,: black filled triangles, replicate A; open triangles, 
replicate B) and cell concentrations (black filled circles, replicate A; open 
circles, replicate B) as a function of time after 8 uumol 1”! iron addition to 
iron-limited P. multiseries originally grown at 6.2 nmoll ' [Fe,]. d, Relative 
transcript abundance of FTN:ACT (black filled diamonds, replicate A; open 
diamonds, replicate B) as a function of time after iron resupply. For a and b, 
symbols represent means of biological replicates + s.e.m. (n = 3). For d, 
symbols represent means of technical triplicates + s.d. 


(1/Mmax = 9.93 Fig. 2a) at this [Fey], highlighting the significantly higher 
iron-use efficiencies of oceanic diatoms’®*'’”. Under iron-limiting con- 
ditions, both species displayed lower 1, lower maximum photochemi- 
cal yield of photosystem II (F,/F,,), and lower chlorophyll a content 
(Fig. 2a and Supplementary Fig. 5a). For both species, the lowest 
amounts of ferritin transcripts corresponded to the lowest [Fer] 
(Fig. 2b). Significantly higher ferritin transcript abundances (one- 
way analysis of variance, Tukey test, P< 0.05) were observed at [Fer] 


Figure 3 | Ferritin phylogenetic tree. Bootstrap 
consensus tree (100 replicates) showing the 


Cyanobacteria evolutionary relatedness of maxi-ferritins from 
cee as 26 taxa inferred using a protein distance model 
Bacteria and archaea Prochlorococcus § 4p 
MIT9313 1 (see Methods). Bootstrap values greater than 50 
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that supported ~[max (that is, >42 nmol 1"! for P. multiseries, and 
>12.9nmoll' for P. granii). A [Fer] of 12.9 nmoll* is equivalent 
to estimates of artificial'® and sporadic natural* iron inputs into iron- 
limited surface waters (see Supplementary Note 1). Over an identical 
range of [Fey] (6.2-1,370nmoll~'), oceanic P. granii displayed a 
20-fold range in ferritin transcript abundance, whereas coastal P. multi- 
series displayed a twofold range, further emphasizing the enhanced 
sensitivity of oceanic species to ambient iron concentrations. 

To mimic the sporadic availability of iron observed in nature, iron- 
limited P. multiseries cells were re-supplied with a pulse of iron. 
Coinciding with an immediate increase in F,/F,, (Fig. 2c), ferritin 
transcript abundance increased and peaked within 6h of iron addi- 
tion, decreased to a minimum at 12 h and then gradually returned to 
levels measured in acclimated iron-replete cells (Fig. 2d). As in other 
eukaryotes’’, these results are consistent with Pseudo-nitzschia ferritin 
serving essential roles in both protection from oxidative stress (a rapid 
response to excess iron) and iron storage (a longer term acclimation). 

Our identification of ferritin in pennate diatoms represents the first 
evidence of this iron-concentrating protein among the Stramenopiles, 
a major lineage of eukaryotes (Supplementary Fig. 6). We detected a 
ferritin-like gene in five additional open-ocean Pseudo-nitzschia 
species, in the closely-related Southern Ocean pennate diatom, 
Fragilariopsis cylindrus and in the whole genome sequence of the 
pennate diatom, Phaeodactylum tricornutunr® (Supplementary Table 2 
and Supplementary Fig. 1). Ferritin genes were not detected in 
available whole genome sequences of other Stramenopiles— 
Thalassiosira pseudonana” (centric diatom), Aureococcus anophagefferens 
(Pelagophyte) and two Phytopthora spp. (Oomycetes). Thus, there is 
no current evidence for iron storage by ferritin in centric diatoms or 
other Stramenopiles. 

Ferritins from other photosynthetic eukaryotes (red and green 
algae, plants) are closely related to each other and to those found in 
animals (Fig. 3). In contrast, P. multiseries and P. tricornutum ferritins 
are only 24% identical (excluding predicted targeting peptides) and 
do not cluster with each other or with the other major maxi-ferritin 
branches, indicating that a common ancestor is not present among 
available ferritin sequences. The apparent absence of ferritin in the 
evolutionarily more ancient centric diatoms” in combination with 
large sequence disparities between Phaeodactylum and Pseudo- 
nitzschia suggest that the presence of ferritin in these diatoms may 
reflect lateral gene transfer events”’. 

Consistent with their divergent evolutionary histories, pennate and 
centric diatoms have different iron acquisition” and storage strategies. 
Oceanic diatoms dramatically reduce their intracellular iron require- 
ments under iron-limiting conditions'®”*’*. Under high-iron condi- 
tions, however, oceanic Pseudo-nitzschia spp. store significantly more 
iron than oceanic Thalassiosira spp. (see Supplementary Note 2 and 
Supplementary Fig. 7). A continuous culture approach was implemen- 
ted to determine whether iron storage by ferritin in oceanic P. granii 
provides a competitive advantage over the putative non-ferritin- 
containing oceanic centric diatom T. oceanica. In each experiment, 
cells were pre-acclimated to either high or low iron-replete conditions 
(max Maintained) before transfer to a continuous culture with iron- 
free medium (see Supplementary Note 3). 

High-iron P. granii cells maintained >90% F,/F,, and maximum 
cellular chlorophyll a fluorescence for 5.4 cell divisions and com- 
pleted 9.2 cell divisions before 4 decreased below 80% of fMmax 
(Fig. 4a and Supplementary Fig. 5b). Elevated ferritin transcript 
abundances were maintained for >4.5 cell divisions before rapidly 
declining to a minimum, suggesting a concomitant reduction in 
ferritin-stored iron. The decline in F,/F,, as iron stores are depleted 
probably reflects a reduction in photosynthetic efficiency and re- 
organization of intracellular iron to maintain pmax. Low-iron 
P. granii cells maintained >90% F,/F,, for 1.6 divisions and >80% 
of max for 2.8 divisions; ferritin transcript abundance began to 
decline almost immediately, consistent with minimal iron storage 
by these cells (Fig. 4b). Together, these results indicate that both 
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F,/F,, and ferritin transcript abundance are sensitive to ferritin iron 
stores and that this stored iron directly influences the number of cell 
divisions possible in the absence of added iron. 

High-iron T. oceanica maintained >90% F,/F,, for 3.1 cell divi- 
sions and >80% of {max for 5.3 cell divisions (Fig. 4c), indicating that 
iron stores in this diatom supports significantly fewer divisions than 
comparably grown P. granii. P. granii undergoes about 2.3 additional 
cell divisions before the decline in F,/F,, and an additional 1.6 extra 
cell divisions before yz declines relative to T. oceanica, suggesting that 
approximately 60% (2.3/(2.3 + 1.6) X 100) of the overall additional 
cell divisions by P. granii are due to enhanced iron storage. Assuming 
similar starting cell densities of P. granii and T. oceanica, 4 extra cell 
divisions by P. granii translates into ~16-fold more P. granii cells at 
the end of an iron-induced bloom. Enhanced iron storage in com- 
bination with higher iron-use efficiencies'® explains why oceanic 
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Figure 4 | Growth characteristics and ferritin transcript abundance in the 
oceanic diatoms P. granii and T. oceanica grown in iron-free continuous 
cultures. a, High iron (1,370 nmol] 1) pre-acclimated P. granii; b, low iron 
(12.9 nmol! ') pre-acclimated P. granii; and c, high iron (1,370 nmol V}) 
pre-acclimated T. oceanica. Shown are the 4-h interval growth rates (11, open 
circles), maximum photochemical yield of photosystem II (F,/Fm grey filled 
circles) and relative transcript abundances of ferritin normalized to actin 
(FTN:ACT) (black filled circles; for P. granii only) as a function of cell 
divisions. Dashed lines indicate the dilution rate (set at 80% of {lmax). Light 
grey bars indicate the decline in F,/F,, below 90% of the maximum ratio. 
Dark grey bars indicate the permanent decline in growth rates below the 
dilution rate. Error bars represent the s.d. of technical triplicates. 
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Pseudo-nitzschia commonly achieve cell densities an order of mag- 
nitude greater than oceanic centric species such as T. oceanica during 
iron-fertilization-induced blooms*. 

Identification of ferritin in a subset of pennate diatoms helps elu- 
cidate how some open ocean phytoplankton take advantage of pulsed 
iron supplies by safely sequestering large amounts of iron that sup- 
port subsequent growth and divisions well after iron levels return to 
low, ambient concentrations. The ability to store iron may be crucial 
to achieving ample seed populations that persist between iron input 
events. The dominance of both Pseudo-nitzschia and Fragilariopsis 
during the numerous iron-enrichment experiments performed in 
iron-limited regions around the globe is probably not coincidental 
but rather due in part to high iron storage mediated through ferritin. 
The disparity between organismal and ferritin sequence phylogenies 
suggests that acquisition of ferritin may have been a relatively recent 
event that facilitated radiation of pennate diatoms into the open 
ocean (see Supplementary Note 4). Our results highlight how 
changes in the ocean environment, spanning near-instantaneous to 
geological timescales, shape the diversity and distributions of import- 
ant groups of marine primary producers. 


METHODS SUMMARY 


Full-length FTN cDNA sequence was obtained using RACE. Cultures were grown 
using trace metal clean techniques in Aquil medium buffered with 100 umoll~! of 
EDTA. Acclimated 1 values were estimated from in vivo chlorophyll a fluor- 
escence. Maximum photochemical yields of photosystem II were determined with 
a Phyto-PAM fluorometer (Walz). Cell counts were performed by microscopy 
using a Sedgwich-Rafter slide or with a Cytopeia Influx Cell Sorter flow cytometer. 
Total RNA (1-2 |g) was reverse transcribed using the 1st Strand cDNA Synthesis 
Kit with oligo-dT primers (Invitrogen) and used in qRT—PCRs with an iCycler iQ 
Real-Time PCR Detection System (Bio-Rad Laboratories) and linearized plas- 
mids containing target genes as standards. Evolutionary relatedness of the Pseudo- 
nitzschia ferritin sequences to other ferritins was determined using the protein 
distance model JTT in the PHYLIP software package. The P. multiseries FTN 
genomic DNA lacking the signal peptide and plastid targeting sequences was 
overexpressed in Escherichia coli and rPmFTN was purified by anion exchange 
chromatography. Purity and molecular weight of rPmFTN monomer were deter- 
mined by SDS-PAGE and the size of purified rPmFTN was determined with a 
calibrated gel filtration column. Ferroxidase activity was assayed spectroscopically 
by monitoring the increase in absorbance during aerobic titration of a 
0.76 umol!! solution of purified rPmFTN (in 50 mmoll7! MES, 100 mmol17! 
NaCl, pH 6.5) with ferrous sulphate solution in ~8 Fe equivalents up to ~ 1,000 
Fe(1):ferritin. Oxygen consumption was monitored using an Apollo 4000 Free 
Radical Analyser fitted with a Clark type electrode. Crystals of rPmFTN were 
obtained from 1.4-1.6 moll” ' ammonium sulphate and 0.1 moll~' ammonium 
acetate buffer pH 5.5. X-ray data were collected from iron soaked and apo-ferritin 
crystals and the structure was solved by molecular replacement using the 
Pyrococcus furiosus ferritin (PDB ID 2JD6) as a model. Anomalous dispersion 
data were used to identify iron atoms in the structure. 
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Discovery of a sexual cycle in the opportunistic fungal 
pathogen Aspergillus fumigatus 


Céline M. O'Gorman’”, Hubert T. Fuller’ & Paul S. Dyer? 


Aspergillus fumigatus is a saprotrophic fungus whose spores are 
ubiquitous in the atmosphere’. It is also an opportunistic human 
pathogen in immunocompromised individuals, causing poten- 
tially lethal invasive infections**, and is associated with severe 
asthma and sinusitis*. The species is only known to reproduce 
by asexual means’, but there has been accumulating evidence for 
recombination and gene flow from population genetic studies” *, 
genome analysis”’®, the presence of mating-type genes®*’® and 
expression of sex-related genes® in the fungus. Here we show that 
A. fumigatus possesses a fully functional sexual reproductive cycle 
that leads to the production of cleistothecia and ascospores, and 
the teleomorph Neosartorya fumigata is described. The species has 
a heterothallic breeding system; isolates of complementary mating 
types are required for sex to occur. We demonstrate increased 
genotypic variation resulting from recombination between mating 
type and DNA fingerprint markers in ascospore progeny from an 
Irish environmental subpopulation. The ability of A. fumigatus to 
engage in sexual reproduction is highly significant in understand- 
ing the biology and evolution of the species. The presence of a 
sexual cycle provides an invaluable tool for classical genetic ana- 
lyses and will facilitate research into the genetic basis of patho- 
genicity and fungicide resistance in A. fumigatus, with the aim of 
improving methods for the control of aspergillosis. These results 
also yield insights into the potential for sexual reproduction in 
other supposedly ‘asexual’ fungi. 

A. fumigatus is found worldwide in soils and on organic debris, 
where it is important in nutrient recycling’. The species has long 
been considered asexual, with dispersal achieved by the abundant 
production of haploid conidia’. Inhaled conidia are normally elimi- 
nated by the innate immune response. However, A. fumigatus has 
become the most prevalent airborne fungal pathogen as a result of its 
ability to cause infections in immunocompromised hosts, with a 
human mortality rate of at least 50% (refs 2, 3, 11). Airborne conidia 
are also allergens, associated with asthma, allergic sinusitis and 
bronchoalveolitis™*. 

The importance of A. fumigatus to human health has led to genome 
sequencing and investigations into the population biology of the 
species’*'’, Studies have revealed that, despite its supposedly asexual 
status, A. fumigatus has many characteristics of a sexual species. 
Genome screening detected 215 genes implicated in sexual develop- 
ment”, including a high-mobility group (HMG)-domain gene 
located at a ‘mating-type’ (MAT) locus typical of sexually reproducing 
heterothallic (obligately outcrossing) euascomycete fungi’. Such 
HMG genes are believed to be ancestral sex determinants in fungi™*. 
For sex to occur in heterothallic fungi, it is necessary for isolates of 
opposite MATI-1 and MAT1-2 mating types to be present’; by con- 
vention these contain either a MATI1-1 alpha-domain or MAT1-2 
HMG-domain gene, respectively, at the MAT locus'*. These MAT 


genes encode transcription factors that are master regulators of sexual 
reproduction, required for normal sexual development in Aspergillus 
and other species'*'®. A related study* identified isolates of A. fumi- 
gatus containing the complementary alpha-domain MAT gene, and 
showed expression of MATI-1 and MAT1-2 and of genes encoding 
sex pheromones and pheromone receptors. In addition, a nearly 1:1 
ratio of MATI-1:MATI1-2 isolates was found among a worldwide 
screening of 290 isolates, which is consistent with latent sexuality’. 
Gene recombination was also detected within a subset of isolates®, in 
agreement with other studies of A. fumigatus showing evidence of 
recombination and no detectable population structure, albeit with 
predominantly clonal reproduction®”'”'’”. The observed recombina- 
tion was attributed to past meiotic events, a so far undetected “cryptic 
sexual state or parasexual gene flow”®*. 

Previous population studies of A. fumigatus involved analysis of 
global collections or limited regional sample sets”*!”'’. In this study 
we screened for mating type® in an Irish environmental population of 
A. fumigatus, composed of 91 isolates collected from five locations in 
Dublin during 2005 in the course of aerial sampling work'® 
(Supplementary Table 1). Isolates were also fingerprinted by 
randomly amplified polymorphic DNA (RAPD)-PCR to detect 
clonality and assess genetic relatedness (see Methods). MATI-1 or 
MATI1-2 specific bands amplified from all except one isolate 
(Supplementary Fig. 1). Eighty-eight of the isolates had unique 
RAPD profiles, showing genetic variability within the Dublin popu- 
lation. A ratio of 49.4% MATI-1 to 50.6% MAT1-2 was evident in the 
clone-corrected data set, with no significant difference in mating- 
type distribution at any of the collection sites (P>0.05; 
Supplementary Table 2) and no evidence for clustering of isolates 
according to collection site or mating type (Fig. 1). Indeed, both 
mating types were found in samples collected within minutes of each 
other, showing that MATI-1 and MAT1-2 isolates are in close prox- 
imity in nature. Results are in agreement with previous studies that 
found MAT genotypes in nearly equal proportions*”’, although one 
survey reported a predominance of MAT1-2 (ref. 17). 

Species identity of the Dublin isolates was verified by molecular 
analysis, given possible misidentification of A. fumigatus sensu stricto 
by morphotyping’’”’. A subset of 12 representative isolates (6 MAT1-1, 
6 MATI1-2) was chosen (Supplementary Table 1), and regions of their 
B-tubulin (benA) and carboxypeptidase-5 (cxp) genes were sequenced 
(S. A. Balajee, D. Nickle, L. Razai, S. F. Hurst & K. A. Marr, personal 
communication; see Methods). All isolates had identical sequences to 
those of known A. fumigatus depositions (for example, GenBank 
AY685162 (benA) and DQ438742 (cxp)) except for Af—R931, which 
had only one nucleotide difference in the cxp amplicon. 

We then set up crosses with the 12 confirmed A. fumigatus sensu 
stricto isolates in all possible combinations of opposite mating types 
(n= 36) on a range of growth media at different temperatures to 
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Figure 1| Genetic diversity of 91 A. fumigatus isolates from Dublin, Ireland. 
Radial dendrogram derived from RAPD-PCR data pooled from primers 
UBC90, R151, R108 and RCO8. The tree was constructed by neighbour- 
joining analysis with branch lengths drawn to reflect genetic distance derived 
from Jaccard’s coefficient of RAPD band matching (scale bar: 0.1 = 10% 


determine whether it was possible to induce sexual reproduction in 
vitro (see Methods). After 6 months of incubation, mature cleistothecia 
(spherical fruiting bodies characteristic of sex in the aspergilli*') were 
found on pairings grown on Parafilm-sealed Oatmeal agar” plates at 
30 °C in the dark (Fig. 2a—d and Table 1). The light-yellow cleistothecia 
(typically 150-500 jtm in diameter) formed singly or in small clusters 
of two to five, mainly along the junction (barrage zone) where hyphae 
of the parental isolates came into contact, and to a smaller extent within 
mycelium on either side of the barrage zone. The related heterothallic 
species Neosartorya spathulata produces cleistothecia scattered or 
grouped in clusters”’, whereas N. fennelliae generally produces copious 
numbers of cleistothecia along lines of mycelial contact™*. There was 
marked variation in the numbers of cleistothecia produced between 
different pairings (Table 1). It is not yet known whether this variation is 
typical of the species. Cleistothecia failed to develop when other media 
and incubation temperatures were used, or in pairings between isolates 
of the same mating type. When cleistothecia were squashed, numerous 
yellowish-white to greenish-white lenticular ascospores (4-5 Jim in 
diameter) with two equatorial crests were observed, together with 
occasional intact asci (Fig. 2e-g). The ridged ornamentation of the 
ascospores is similar to that in the homothallic species N. assulata”. 
The teleomorph (sexual stage) of A. fumigatus was assigned to the 
genus Neosartorya on the basis of phylogenetic relatedness and mor- 
phology of the cleistothecia and ascospores'’*”®, and named 
Neosartorya fumigata O’Gorman, Fuller & Dyer sp. nov. 

Ascospores of N. fumigata germinated on 2% malt extract agar 
(MEA) at 28 °C within 15h and also germinated after exposure to 
70 °C for 90 min (Fig. 2h), heat resistance of ascospores being typical 
of Neosartorya species*’. Ascospore-derived cultures gave rise to 
characteristic A. fumigatus colonies bearing abundant asexual 
conidia under routine growth conditions. 

Finally, 15 ascospore progeny from each of three different crosses 
were assessed for meiotic recombination by examining the segregation 
of five genetic markers (four RAPD bands and the mating-type 
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genetic difference). Numbers at nodes indicate the percentage bootstrap 
support (based on 1,000 resamplings) for which the cluster is supported 
(only values of 60% or more are shown, for clarity). MAT1-1 and MAT1-2 
genotypes are shown in red and green, respectively. 


genotype) (Fig. 3). Between 53% and 67% of progeny showed unique 
genotypes, with only 7-13% of progeny identical to one of the 
parents (on the basis of the markers examined), providing clear 
evidence of recombination during the sexual cycle (Supplementary 
Tables 3-5). The segregation of mating type was consistent with a 
heterothallic breeding system. Fisher’s exact test confirmed that 
genetic markers were equally distributed between the progeny in each 
cross, verifying the null hypothesis of 1:1 Mendelian segregation as a 
result of independent assortment. 

A. fumigatus was first described 145 years ago”’, but its teleomorph 
has remained undiscovered until now. Indeed, previous efforts failed 
to induce sexual reproduction despite considerable effort” (G. S. May, 
N. Osherov and J. Kwon-Chung, personal communication). There are 
several possible explanations of why the sexual cycle has not been 
reported before. Environmental parameters rarely encountered in 
nature might be required to trigger sex’’, or cleistothecial production 
may be confined to substrata that have not been systematically moni- 
tored. Sexuality has now been demonstrated in the once presumed 
asexual pathogen Candida albicans, with mating suggested to occur 
only in particular host niches*®. Similarly, sexual reproduction has 
been detected in previously ‘asexual’ phytopathogenic Tapesia and 
Mycosphaerella species on certain host substrates’”**. Alternatively 
the species as a whole may be experiencing a ‘slow decline’ in fertility 
— possibly linked to variation in MAT gene expression*”’. Thus, the 
Dublin isolates might represent a rare subset of fertile isolates, 
although there is evidence that A. fumigatus comprises one global, 
recombining population”” (see Supplementary Discussion). 

The discovery of a sexual cycle in A. fumigatus provides insights 
into the biology and evolution of the species. It helps explain the 
presence of diverse genotypes despite predominantly clonal repro- 
duction®””’, conservation of sex-related genes’®”’, aspects of genome 
evolution” and defence against repetitive elements’. In addition, 
ascospores might aid survival in adverse environmental conditions. 
The discovery also has significant medical implications. Sexual 
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Figure 2 | Neosartorya fumigata sp. nov. a, Paired culture of 

AfRB2 < AfIR956 incubated for 6 months at 30 °C on Oatmeal agar medium 
in a 9-cm diameter Petri dish. b, Cleistothecia (arrows) along junctions of 
intersecting colonies of opposite mating type. Scale bar, 1 cm. ¢, Cleistothecia 
among chains of conidia. Scale bar, 400 jum. d, Scanning electron micrograph 
of a cleistothecium, showing the peridium of interwoven hyphae. Scale bar, 
100 um. e, f, Scanning electron micrograph of an eight-spored ascus (e) and 
ascospores (f). Scale bars, 2 jim. g, h, Ascospores ungerminated (g) and 
germinating after exposure to 70 °C for 60 min (h). Scale bars, 20 um. 


Table 1| Numbers of cleistothecia produced by A. fumigatus crosses 


Number of cleistothecia 


Cross MAT1-1 
AfIR909  AfIR931 AFfIR954 AFIR957 AFIR974 AfRB2 

MAT1-2 AflR901 t t+++4 + 

AfIR915 + + + 

AfIR928 > t + + t+ +4 > 

AfIR956 + + t + + 

AfIR964 t t t > 

AfRB3 t + + + b+++4 
Ratings indicate the mean number of cleistothecia produced from two replicate crosses on 
Oatmeal agar medium in 9-cm diameter Petri dishes after incubation in the dark for 6 months at 


30°C: —, none; +, 1-19; ++, 20-39; +++, 40-59; ++++4, 60-79; +++++, 80-100; >, 
more than 100 cleistothecia. 
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Figure 3 | Segregation patterns of molecular markers. Representative 
RAPD-PCR data from N. fumigata parental isolates (P1 and P2) and 15 
ascospore progeny from the cross AfIR974 x AfIR964 using primer OPW-10. 
MM, molecular mass marker; P2, AfI[R964; P1, AfIR974; lanes 1-15, progeny 
974-964-1, 974-964-3 to 974-964-16; C, water control. Coloured arrows 
indicate bands designated OPW-10-1 (purple), OPW-10-2 (yellow) and OPW- 
10-3 (blue) (Supplementary Table 3). Lane headings are shown in red and 
green to indicate MAT1-1 and MAT1-2 genotypes, respectively. kb, kilobase. 


reproduction can result in progeny with increased virulence or res- 
istance to antifungal agents, and can confound diagnostic tests based 
on the assumption of clonality*’’. However, the sexual cycle also 
offers a valuable tool with which to determine the genetic basis of 
traits of interest. Pathogenicity in A. fumigatus seems multifactorial 
in nature®*"', with genome analysis revealing potential targets’. 
Classical genetic analyses may help in elucidating the contribution 
of particular components, thereby guiding disease control strategies. 
Results are also of general significance, given that almost one-fifth 
of all fungi have no known sexual stage. This includes many 
Aspergillus, Penicillium, Coccidioides and Malassezia species of great 
economic and medical importance. Some of these species have appar- 
ently functional MAT and other sex-related genes'*”’ (Supplementary 
Discussion). Recombination in asexual populations is often attributed 
to parasexual reproduction. However, parasexuality is limited to iso- 
lates of the same vegetative compatibility grouping’. According to the 
A. fumigatus model, cryptic sexuality is a more likely source of recom- 
bination, and isolation of fresh cultures combined with concerted 
laboratory crossing efforts of compatible mating types may lead to a 
sexual revolution for many of these other supposed ‘asexuals’. 


Neosartorya fumigata O’Gorman, Fuller & Dyer sp. nov. 


Etymology. The name fumigata is derived from the Latin fumigare ‘to 
make smoke’, which refers to the clouds of spores produced by the 
fungus. 

Holotype. Dried oatmeal agar plate with teleomorph of paired Asper- 
gillus fumigatus colonies, AfRB2 (MATI-1) X AfIR956 (MAT1-2), 
both collected by C. O’Gorman from outdoor air at Belfield, Dublin, 
November and December 2005, respectively; deposited in the 
Herbarium of the Royal Botanic Gardens, Kew (K) (K(M)159484). 
Isotypes in K (K(M)159485) and the National Herbarium, National 
Botanic Gardens, Dublin (DBN), each with a slide preparation of 
squashed cleistothecia. 

Referred material. Paired cultures of A. fumigatus AfIR957 (MAT1-1) 
< AfIR928 (MATI-2) and of AfIR974 (MATI-1) X AfIR964 (MAT1-2) 
(see Supplementary Table 1 for isolate details). 

Latin diagnosis. Fungus heterothallicus. Cleistothecia superficialia, cincta 
gossypio mycelio, globosa, 150-480 (600) 1m in diametro, dispersa singula 
vel coniuncta, primo alba vel luteo-alba, deinde mutans ad dilutam luteam 
vel flavo-griseam. Peridium factum est ex permultis propaginibus anguste 
intertextis, et interdum amplificatis hyphis, in tempore cretum simule 
membranae. Asci varie subglobosi, octospori, maturi evanescentes. 
Ascosporae luteae albae ad pallidas virentes, unicellulares, late lenticulares, 
raro globosae, cum duabus cristis aequatorialibus—0O.3-1.0 um latis, sine 
cristis 4-5 um; superficies convexae cum costis reticulatis proiectis, reticu- 
lum variat, ac costae effusae usque ad cristas; levis inter cristas, aliquando 
lapilli in medio earum. Status anamorphus: Aspergillus fumigatus 
Fresenius. 

Diagnosis. Heterothallic fungus. Cleistothecia superficial, surrounded 
by cottony mycelium, globose, 150-480 (600) um in diameter, scat- 
tered or grouped in small clusters, at first white to yellowish white 
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(4A2) then turning light yellow (4A4) to greyish yellow (4B5). 
Peridium composed of several layers of tightly interwoven, occasion- 
ally flattened hyphae; membranous with age. Asci irregularly sub-- 
globose, eight-spored, evanescent at maturity. Ascospores yellowish 
white (1A2) to greenish white (28A2), one-celled, broadly lenticular, 
rarely spherical, with two equatorial crests—0.3—1.0 um wide, spore 
body 4—5 um; convex surface with prominent ridges forming a reticu- 
late ornamentation, degree of reticulation varies, ridges spreading to 
the equatorial crest; area between crests generally smooth, occasionally 
with a midline of small projections. Anamorph: Aspergillus fumigatus 
Fresenius. 


METHODS SUMMARY 


Aspergillus fumigatus isolates were crossed in pairwise MATI-1 and MAT1-2 
combinations in duplicate on Oatmeal agar medium” (Pinhead Oatmeal; 
Odlum Group) and were incubated inverted at 15, 30 or 45°C in the dark. 
Crosses were also established on 2% MEA (Oxoid), Czapek Dox agar and 
Aspergillus complete medium® at 30 °C in the dark. Spore suspensions of each 
isolate (5 X 10° conidia ml!) were prepared from seven-day-old cultures. Two 
1-l aliquots of each spore suspension were separately inoculated onto the agar 
surface about 4 cm apart and perpendicular to aliquots of conidia of the opposite 
mating type. This configuration created four interaction/barrage zones as col- 
onies grew (see Fig. 2a, b). Plates were sealed with one layer of Parafilm. Crosses 
were examined for cleistothecia periodically over 6 months with Olympus 
SZH10 Stereo and BX45 light microscopes. Cleistothecia were removed with a 
sterile needle tip and cleaned of adhering conidia by gentle rolling across a drop 
of water under sterile conditions. Individual cleistothecia were added to 100 pl of 
0.05% Tween 80 and ruptured by squashing with a needle tip. The resulting 
macerate (of peridium, asci, ascospores and contaminating conidia) was trans- 
ferred to 1.9 ml of 0.05% Tween 80 and vortex-mixed for 1 min to disrupt the 
asci. The suspension was heated at 70 °C for 60 min, which prevented the ger- 
mination of conidia. Aliquots (100 pl) of the suspension were spread plated on 
2% MEA (9-cm plates) and incubated overnight at 28°C. Single-ascospore 
cultures were established on 2% MEA by transferring individual germinating 
ascospores with a LaRue lens cutter attached to an Olympus BX45 microscope. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Maintenance and crossing of isolates. Single-spore cultures of each isolate were 
routinely maintained on MEA slopes at 4 °C in the dark. Crosses were set up on 
Oatmeal agar medium”, Czapek Dox agar*! and Aspergillus complete medium’. 
Light microscopy and scanning electron microscopy. For micro-morphological 
examination of N. fumigata sexual reproductive structures, mounts were made in 
clear lactophenol from cultures grown on Oatmeal agar medium” and colours 
were assigned using Methuen colour names and numbers”. Dimensions of cleis- 
tothecia and ascospores (n= 20) were measured with the software package 
Olympus DP-Soft, version 5.0, linked to an Olympus BX-45 light microscope 
and an Olympus C-5060 Wide Zoom digital camera. For scoring purposes, cleis- 
tothecia were defined as spherical white to yellow structures more than 150 um in 
diameter. Scanning electron microscopy was performed with a Jeol JSM-5410LV 
scanning electron microscope. For preparation for scanning electron microscopy, 
mature cleistothecia were transferred to aluminium stubs with the use of double- 
sided adhesive tape. A small drop of 0.05% Tween 80 was added and the cleis- 
tothecia were crushed. The suspensions were air-dried and sputter-coated with 
gold. 

DNA extraction. Cultures were grown in 2% liquid malt extract for 5 days at 
28 °C, washed with a 0.1M phosphate buffer (pH 7.0) and ground into a fine 
powder under liquid nitrogen. Genomic DNA was extracted with a DNeasy Plant 
Mini Kit (Qiagen) in accordance with the manufacturer’s instructions. 
Mating-type PCR assay. The mating-type genotype of each isolate was deter- 
mined with a multiplex-PCR assay as previously described*, with reaction 
volumes of 25 pl containing 50-100 ng of DNA. Segregation of mating-type 
genes in ascospore progeny has previously been used to infer the breeding sys- 
tems of other filamentous ascomycete species, especially lichen-forming 
fungi. 

Statistical analyses. The null hypothesis was for a 1:1 ratio of the two mating 
types, as is characteristic for a randomly mating population of haploid organ- 
isms*°. The hypothesis was tested with 7’ and contingency 7’ tests, with P< 0.05 
considered significant. Fisher’s exact test was used instead of the z* test when 
sample sizes were small, because it is more accurate when expected frequencies 
are less than five’®. 

RAPD-PCR protocol for clone-correction and genetic relatedness. RAPD- 
PCR was performed with cycling conditions described previously’. Four pri- 
mers were used to fingerprint the 91 A. fumigatus isolates: RCO8 (5’- 
GGATGTCGAA-3’)**, R108 = (5'-GTATTGCCCT-3’)”, = RI51_ —(5'- 
GCTGTAGTGT-3’)” and UBC90 (5’-GGGGGTTAGG-3’)*°. PCR amplifica- 
tions were performed in 25-ul reaction volumes containing 0.1-1.0ng of 
DNA. Each amplification reaction contained 2.5 pl of 10x DyNAzyme buffer, 
0.75U of DyNAzyme II Polymerase (2Upl~'; Finnzymes OY, Flowgen 
Laboratories), 0.15 ul of 25 mM dNTP mix (ABgene), 50 mol of a single ten- 
base-pair primer and ultra-pure water (Sigma). Amplicons were resolved in 
1.5% agarose gels (data not shown). Clear, reproducible RAPD-PCR amplicons 
from each primer were scored as 0 (amplicon absent) or 1 (amplicon present) in 
Microsoft Excel. The data from all primers were pooled for each isolate, and a 
pairwise similarity matrix was calculated with Jaccard’s coefficient’ with the 
program FREETREE” vy. 0.9.1.50. A bootstrapped dendrogram with 1,000 repli- 
cations was produced with NJ Plot. It is noted that the most divergent isolate 
detected by this analysis, AfIR917, was also the only isolate that failed to amplify a 
MAT product in the multiplex mating-type assay, suggesting sequence diver- 
gence in the MAT region and that this isolate was not A. fumigatus sensu stricto. 
RAPD-PCR for genotypic markers. The 12 parental isolates (indicated in 
Supplementary Table 1) were screened with 37 ten-base RAPD primers 
(Operon Biotechnologies Kits OP-A, -J, -W and -X) for the amplification of 
bands suitable for genotypic markers. Five primers (A-05, J-01, W-10, W-19 and 
X-05) that amplified clear, reproducible bands unique to one of the parental 
isolates were selected. RAPD-PCR reactions were performed as described above. 
Segregation of RAPD-PCR markers has previously been used to determine sex- 
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Gene sequencing. The genes encoding {-tubulin and carboxypeptidase-5 were 
amplified with the primer sets benA (forward, 5’-AATTGGTGCCGCT- 
TTICTGG-3’; reverse, 5’-AGTTGTCGGGACGGAATAG-3’) and cxp (forward, 
5'-GAACATTAGCCCCAGTTGAG-3’; reverse primer, 5'-CACTTCTTCTTG- 
CACGTAGTC-3’), respectively (S. A. Balajee, D. Nickle, L. Razai, S. F. Hurst 
& K. A. Marr, personal communication). The amplified DNA fragments were 
purified with a QIAquick PCR purification kit (Qiagen). DNA sequencing of the 
forward strand of each fragment was performed at the Biopolymer Synthesis and 
Analysis Unit, University of Nottingham. The resulting sequences were aligned 
in CLUSTALW* using the program BioEdit*”. 

Measurement of ascospore germination. Using a fine-tipped needle, an intact 
mature cleistothecium was removed from a 12-month-old culture of an N. 
fumigata AfIR956 X AfRB2 cross. An ascospore suspension was prepared and 
heat-treated. Ascospore concentration (about 4 X 10* spores ml” ') was deter- 
mined with a haemocytometer. Defined areas on 2% MEA plates were each 
spread inoculated with 50 pl of the heat-treated ascospore suspension. Plates 
were sealed with one layer of Parafilm and incubated at 28 °C in darkness over- 
night. Coverslips were then placed on the agar surfaces over the defined areas, 
and levels of ascospore germination were determined. The percentage germina- 
tion of heat-treated (70°C, 60min) ascospores was 50.5 + 3.4% (n= 600; 
mean + s.e.m.) after 15h at 28 °C. 
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Anticipatory haemodynamic signals in sensory cortex 
not predicted by local neuronal activity 


Yevgeniy B. Sirotin’ & Aniruddha Das’”**°® 


Haemodynamic signals underlying functional brain imaging (for 
example, functional magnetic resonance imaging (fMRI)) are 
assumed to reflect metabolic demand generated by local neuronal 
activity, with equal increases in haemodynamic signal implying 
equal increases in the underlying neuronal activity’°. Few studies 
have compared neuronal and haemodynamic signals in alert ani- 
mals’* to test for this assumed correspondence. Here we present 
evidence that brings this assumption into question. Using a dual- 
wavelength optical imaging technique’ that independently mea- 
sures cerebral blood volume and oxygenation, continuously, in alert 
behaving monkeys, we find two distinct components to the haemo- 
dynamic signal in the alert animals’ primary visual cortex (V1). One 
component is reliably predictable from neuronal responses gener- 
ated by visual input. The other component—of almost comparable 
strength—is a hitherto unknown signal that entrains to task struc- 
ture independently of visual input or of standard neural predictors 
of haemodynamics. This latter component shows predictive timing, 
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Figure 1| Periodic fixation tasks evoke stimulus-independent, trial-linked 
signals even in the dark. a, Normalized emission spectra of the two 
illumination sources (light-emitting diodes), superimposed on in vitro 
absorbance spectra for deoxy- and oxyhaemoglobin"® (units: 

10*cm™! M_’). b, Stim.: V1 ‘blood volume’ response to small, brief, visual 
stimulus presented during periodic fixation trials. ‘Blank’: signal in trial with 
no visual stimulus. ‘Stim.-Blank’: stimulus-specific response. c, Eye position 
and pupil diameter (percentage of mean), consecutive trials. (Vertical 
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with increases of cerebral blood volume in anticipation of trial 
onsets even in darkness. This trial-locked haemodynamic signal 
could be due to an accompanying V1 arterial pumping mechanism, 
closely matched in time, with peaks of arterial dilation entrained to 
predicted trial onsets. These findings (tested in two animals) chal- 
lenge the current understanding of the link between brain haemo- 
dynamics and local neuronal activity. They also suggest the 
existence of a novel preparatory mechanism in the brain that brings 
additional arterial blood to cortex in anticipation of expected tasks. 

We have developed a dual-wavelength optical imaging technique 
to (in effect) image cortical blood volume and oxygenation simulta- 
neously in alert behaving macaques. This technique involves switch- 
ing rapidly between two wavelengths: 530nm (green, equally 
absorbed in oxygenated and deoxygenated haemoglobin, thus mea- 
suring total haemoglobin concentration (HbT) or ‘blood volume’) 
and 605 nm (red, absorbed about fivefold more strongly in deoxyge- 
nated than oxygenated haemoglobin, thus measuring ‘oxygenation’”” 


Trial-related signal 
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dashed lines: trial onsets. Note pupil dilation, fix break, fix acquire, shown 
for first trial.) Scales are colour-coded. d, Cortical signals, colour-coded by 
imaging wavelength. e, f, Trial-triggered averages (grey lines, individual 
trials; thick lines, mean + s.e.m., n = 51: at 605 nm, mean peak-to-peak 
amplitude = 1.19% = 0.08; at 530 nm, 3.47% + 0.21). Population 
histograms: at 605 nm, mean = 0.86%, s.d. = 0.29, N = 47 experiments; at 
530 nm, mean = 2.17%, s.d. = 0.97, N = 66. 
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(Fig. la and Supplementary Methods)). While imaging V1 in animals 
performing periodic visual tasks, we observed a hitherto unknown 
stimulus-independent haemodynamic signal that appeared to 
entrain to trial timing (Fig. 1b). 

To study this trial-related signal in isolation, we developed a task 
that minimized visual input while preserving trial timing. In an 
otherwise completely dark room, the animal was required to fix its 
gaze periodically on a tiny fixation point for juice reward (point size 
about 1—2 arcmin, that is, about 1-2 cone diameters). The fixation 
point stayed on continuously, switching between two equiluminant 
colours to cue the animal to ‘fixate’ or ‘relax’. It was akin to seeing 
nothing apart from one single twinkling star in an otherwise black 
night sky. Our two rhesus macaque monkeys (“V’ and ‘S’) learned the 
task correctly, as evidenced by their fixation patterns (Fig. 1c). Both 
monkeys performed long sequences of correct trials, consistently 
holding fixation during ‘fixate’ periods and taking fixation breaks, 
if any, only during ‘relax’ periods. 

On imaging V1 while the animals performed this task, we observed 
robust haemodynamic signals at the trial frequency, even though the 
animals were in virtually total darkness and foveal V1, the only region 
receiving visual input from the fixation point, lay outside our 
imaging area. These periodic fluctuations were seen in both the 
‘blood volume’ (530nm) and ‘oxygenation’ signals (605 nm; 
Fig. 1d, f). They were accompanied by periodic changes in heart rate" 
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Figure 2 | Local neuronal activity predicts visually driven, but not trial- 
related, haemodynamics. a, Trial-triggered mean haemodynamic (‘blood 
volume’) and electrophysiological signals comparing stimulus-driven and 
dark-room responses, representative experiment. LFP power spectrum 
(bottom) normalized to pre-stimulus dark power (2-Hz resolution). 

b, Comparing measured haemodynamics (green) with optimal predictions 
from concurrent spiking, same experiment. Blue, grey: using kernels (inset) 
obtained by fitting stimulated or dark-room signals respectively. (The same 
colour code is used throughout. Dark-room kernel and prediction almost 
indistinguishable from a flat line; prediction shown for dark only, to avoid 


476 


NATURE|Vol 457|22 January 2009 


and systematic pupil dilation’ on trial onset, suggesting a rhythmic 
state of alertness synchronized to each trial (Fig. 1c—f). 

We wanted to determine the relation between these trial-linked 
haemodynamic signals and V1 neuronal activity. A crucial assump- 
tion in most brain imaging studies is that haemodynamic signals are 
caused by local neuronal responses through a uniform underlying 
mechanism'® (but see refs 13, 14). In particular, brain images are 
routinely used to infer changes in local neuronal activity by fitting the 
imaging signal with some standard causal haemodynamic kernel. To 
reveal neuronal mechanisms underlying V1 haemodynamics, we 
obtained both trial-related and visually evoked optical imaging 
signals concurrently with electrode recordings across V1 (Supple- 
mentary Fig. 1 and Supplementary Table 1). At each site, in alter- 
nating blocks (20-40 trials each) while the animal performed the same 
fixation task, we presented either vigorous visual stimuli or no stimuli 
at all. For each data set we then used an optimization routine to 
calculate the causal kernel that ‘best’ fitted haemodynamics to con- 
current neuronal signals (Supplementary Fig. 2), and tested whether 
this ‘best’ kernel could reliably predict haemodynamics. 

To get measures of neuronal activity for this analysis, we separated 
the electrode recordings into multi-unit spiking (MUA) and local field 
potential (LFP) (Supplementary Fig. 1 and Supplementary Methods). 
As expected, visual stimulation evoked vigorous responses in both 
MUA and LFP (Fig. 2a). The stimulus-evoked LFP responses could 
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clutter.) Black arrows: trial-related activity not predicted in either the 
stimulated or dark-room trials. Blue arrows: random bursts of neuronal 
activity generate matching deflections in the predicted and observed trace. 
Right: scatter-plots and R’ values of observed versus predicted haemodynamics 
using stimulus-based predictors. ¢, Optimal kernels across days (amplitude 
normalized for comparison; N = 28 recording sites). d, e, Descriptive statistics 
of spike-based fits. Top: stimulus-based prediction; bottom: dark-room based. 
d, Ratio of variance between predicted and measured signals (ap/ Ou). e, R’ 
statistic: open versus closed bars represent dark-room versus stimulus-driven 
sessions, respectively. Arrows mark population means. 
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be empirically separated into two distinct frequency bands (Fig. 2a, 
bottom). The high-frequency band (hi-LFP: 66-130 Hz, avoiding 
60 Hz), like MUA, showed crisp, visually evoked responses. The 
low-frequency band (lo-LFP: 10-56 Hz), also showed a robust signal 
but with no apparent correlation with visual stimulation. Our empiri- 
cally defined LFP bands match categories defined through previous 
work. The hi-LFP matches a frequency band (‘high gamma’) that is 
shown to correlate well with stimulus-evoked spiking and haemo- 
dynamics”'*'*®. The lo-LFP—often separated into finer frequency 
bands'>'°—is believed to have a very different relationship with other 
brain signals'>'®. We therefore decided to test the three neuronal 
signal types, MUA, hi-LFP and lo-LFP, independently for their ability 
to predict concurrently recorded haemodynamics reliably. These tests 
were conducted separately for ‘blood volume’ and ‘oxygenation’ 
signals. 

Visually driven MUA and hi-LFP predicted the simultaneously 
recorded haemodynamic signals very well both in amplitude and 
time course (Fig. 2b-e and Supplementary Fig. 3b-e). Further, the 
optimal kernels obtained by fitting these signals were consistent in 
shape across all recording sites (Fig. 2c and Supplementary Fig. 3c, 
top); kernels from any given experiment predicted visually evoked 
responses in all other experiments with almost equal accuracy, attest- 
ing to their remarkable reliability (Supplementary Fig. 4). In sharp 
contrast, the same kernels, when convolved with dark-room MUA or 
hi-LFP, were uniformly poor at predicting trial-related haemo- 
dynamics, in both amplitude and temporal correlation (Fig. 2b—e 
and Supplementary Fig. 3b—e). The latter finding (R” ~ 0.08, MUA; 
0.06, hi-LFP) specifically implies that there is no consistent temporal 
relation between predicted and measured haemodynamics, inde- 
pendent of amplitude. This poor predictability was particularly strik- 
ing because the trial-related haemodynamic signal amplitudes were 
almost comparable to those of responses to vigorous visual stimu- 
lation (37% at ‘blood volume’, 530 nm; 57% at ‘oxygenation’, 605 nm 
(Supplementary Fig. 5)). To check whether trial-related haemo- 
dynamics could still be predicted reliably by concurrent neuronal 
recordings but through kernels of a different shape, we fitted dark- 
room MUA and hi-LFP to dark-room haemodynamics. These ‘best’ 
dark-room kernels were highly variable among recording sites and, 
again, consistently failed to predict trial-related haemodynamics 
(Fig. 2c-e and Supplementary Fig. 3c—e, bottom). The same overall 
pattern of results was seen for both ‘blood volume’ and ‘oxygenation’ 
signals (Supplementary Fig. 6). 

These results provide compelling evidence that visually evoked 
haemodynamic signals are very well predicted by established mea- 
sures of local neuronal activity (MUA, hi-LFP) through a causal 
kernel that is uniform across experiments. Such a model fails pro- 
foundly, however, to predict the trial-related signals. Therefore any 
neuronal mechanisms underlying trial-related haemodynamics 
appear to be distinct from those typically assumed to underlie neu- 
rovascular coupling. 

Unlike MUA or hi-LFP, lo-LFP—whether treated as a whole or 
separated into finer frequency bands—failed to show any consistent 
relationship with haemodynamics. These signals gave highly variable 
“optimal kernels’ when fitted with concurrent haemodynamics either 
under visual driving or in the dark, with uniformly poor predictions 
of haemodynamics (Supplementary Figs 7 and 13). 

Next, we characterized the novel trial-related haemodynamic sig- 
nal in terms of its temporal relation to trial timing. To determine 
whether our observed signals are linked specifically with trial timing 
and not a result of some unrelated intrinsic oscillatory process'’, we 
examined how the signals adapted to different trial periods. Our 
results provided compelling evidence that the signals are linked pre- 
dictively to trial onsets. This was seen both in the signal shapes at each 
trial period and their anticipatory timing on switching trial period. 

We found that the trial-related signals stretched elastically to match 
each tested trial period (Fig. 3 and Supplementary Fig. 8a; tested 6-s to 
30-s trial periods). In particular, the shape of the ‘blood volume’ signal 
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Figure 3 | Trial-related haemodynamic signals entrain to anticipated trial 
onsets, stretching to conform to the trial period. a, Dark-room fixation 
trials, each with the same 4-s ‘fixate’ epochs but in blocks of different trial 
periods, short (8 s, blue) and long (20s; pink) (same colour scheme in other 
panels and Fig. 4). b, c, ‘Blood volume’ signal, short and long trials 
respectively. Averages triggered on trial onsets, + s.e.m. (b: N = 220 trials; 
c: N= 179). Arrowheads: peak brightening (red), darkening (black). Dotted 
white line in ¢ shows where short-trial onset would have occurred. 

d, Population histograms of peak brightening and darkening in units of trial 
phase (0, trial start; 1, trial end, start of next trial; brightening, mean = 0.43, 
s.d. = 0.16; darkening, mean = 0.89, sd = 0.21; N = 66; trial periods ranged 
from 6 to 30s). e, f, Signal at transitions between trial periods: short-to-long 
(e, N= 16 trials, mean = s.e.m.), and long-to-short (f, N = 10); arrowheads 
are aligned, in each case, to the panel above for comparison of signal features. 
Dotted white line as in c. g, Left: short-to-long transition trial (green) is 
statistically indistinguishable from other short trials over one short-trial 
period (blue background). (Bootstrap analysis. Green: mean transition trial; 
grey: means of 500 random N = 16-trial subsets of the short trials to match 
statistics of transition trial; black: grand mean of all short trials, same as 

b; inset histogram: correlation of random subsets with grand mean; 
arrowhead: correlation of transition trial with grand mean = 0.97.) Right: 
short-to-long transition response is distinct from random N = 16-trial 
subsets of long trials. Same conventions as on the left, with the correlation 
coefficients being calculated, again, over the duration of one short period 
(pink background). 


always stretched so as to start darkening (increasing haemoglobin) 
during the ‘relax’ period—before the onset of the next trial—reaching 
a peak darkening close to the onset of the next ‘fixate’ period (Figs 1 
and 3a—d). This elastic pattern of trial-locked haemodynamics—in 
which signals begin changing before trial onsets—cannot be explained 
by mechanisms that involve a causal kernel triggered on trial start. 
This can be demonstrated by comparison with responses to (brief, 
intense) visual stimulation of the same duration as the ‘fixate’ period, 
where the stereotyped response shape, with abrupt onset and fixed 
width after stimulus presentation, is independent of trial period 
(Supplementary Fig. 8b; quantitative model, Supplementary Fig. 8c). 
The trial-related signal is thus unlikely to be due to neuronal signals 
active only during the cued ‘fixate’ period (for example, the presumed 
time course of ‘attention’'’). 

On switching trial timing unexpectedly after the monkey had 
established a rhythm of 10-20 correct trials at a given period, hae- 
modynamic signals continued to oscillate at the earlier period for a 
couple of trials before entraining to the new one (Fig. 3e, f). This 
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Figure 4 | Mean ‘blood volume’ signal is closely matched, temporally, by V1 
arterial contraction-dilation cycle. a, Mean trial-triggered signals and 

b individual frames showing fractional signal change relative to trial-mean 
image. Inset square: green, ‘fixate’; black, ‘relax’. Magnified sections show 
arterial contraction (white walls) and dilation (black walls). Grey trace in 
a: arterial signal relative to ‘parenchyma baseline’ (Supplementary Fig. 11, 
method for calculating arterial signal; arterial trace shifted vertically from 


occurred even though the animal itself picked up the new trial pace 
immediately, holding and breaking fixation at the new rhythm right 
after the switch (that is, clearly having noticed the new pace of fixa- 
tion cues). Thus, on switching from short to long trials the measured 
signals showed a peak darkening at the short-trial spacing even 
though the animal was fixating correctly at the longer period 
(Fig. 3e). Similarly, on switching from long to short trials the cortical 
signal continued at its previous slower pace for one long period, 
overriding the first few short trials (Fig. 3f). The response shape 
observed on transition trials closely resembled pre-transition 
responses for the duration of the pre-transition trial period, while 
being very poorly matched to the post-transition trial shape, suggest- 
ing that the underlying neuronal mechanism continued to ‘antici- 
pate’ the pre-transition trial timing (Fig. 3g and Supplementary 
Fig. 9). Further, the trial-related signal timing was correlated specifi- 
cally with trial onsets and not with reward”: the peak darkening 
position remained unaffected on delaying the reward associated with 
each trial (Supplementary Fig. 8d). 

Finally, images of the cortical surface suggested that the trial- 
related signals involve the local vasculature rather than being a sys- 
temic trial-locked autonomic (for example, cardiac) response”. 
These images revealed a dramatic contraction—dilation cycle in V1 
arteries, evidenced by a prominent brightening, followed by darken- 
ing of the arterial walls relative to the ‘parenchyma’ baseline (Fig. 4a, 
b and Supplementary Fig. 10; Supplementary Fig. 11 indicates how 
arteries, veins and ‘parenchyma’ are distinguished and how the arte- 
rial signal is measured). This arterial signal had a timing that closely 
matched the overall timing of the mean ‘blood volume’, with peaks of 
arterial contraction and dilation coinciding with peaks of brightening 
(decreased haemoglobin) and darkening (increased haemoglobin), 
respectively (Fig. 4a, c-g). The arterial cycle stretched elastically to fit 
trial periods, matching the shape of the mean signal (Fig. 4d, e). 
Further, on switching trial periods, the arterial cycle showed an anti- 
cipatory dilation that was well synchronized with the anticipatory 
increase in ‘blood volume’ seen in the mean signal (Fig. 4f, g). This 
local arterial cycle may thus be the specific mechanism generating 
trial-related increases in V1 ‘blood volume’ in anticipation of visual 
tasks. Further, the arterial cycle is seen in V1 only for visual tasks and 
is likely not a passive consequence of trial-locked changes in heart 
rate or blood pressure”’. We found no V1 arterial pumping or trial- 
related changes in V1 ‘blood volume’ in a periodic auditory control 
task, despite the presence of periodic changes in heart rate and pupil 
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dilation very similar to those seen in our visual task (Supplementary 
Fig. 12). 

Our findings have two major implications: one for the interpretation 
of brain imaging’, the other advancing our knowledge of brain 
mechanisms underlying anticipation. First, the interpretation of 
fMRI’, for example, through general linear modelling”, typically makes 
the crucial assumption of a uniform linear predictive relationship 
between neuronal and haemodynamic signals. We show that this 
model is valid for visually evoked signals, but that it fails profoundly 
to predict another class of signals, of almost comparable magnitude and 
behaviourally linked structure. These results raise the further possibility 
that there may be other, hitherto uncovered exceptions’*"* to the 
assumption that haemodynamic signals uniformly imply equivalent 
underlying neuronal activity. Second, the predictive timing and arterial 
contraction—dilation cycle that we observe in the trial-related haemo- 
dynamic signal suggests that it could reflect a novel anticipatory brain 
mechanism. This mechanism could play the role of preparing cortex for 
anticipated tasks by bringing additional arterial blood in time for task 
onsets. The question of the mechanism driving this signal (for example, 
distal neuromodulatory control of cerebral arteries?) and its functional 
consequences remains a challenge for future investigations. 


METHODS SUMMARY 


Results were obtained using continuous, dual-wavelength intrinsic-signal 
optical imaging and electrode recording in two monkeys engaged in either visual 
fixation tasks or auditory control tasks. Standard alert-monkey optical imaging 
techniques** were used to record the intrinsic cortical signal, continuously, 
through a clear silicone artificial dura and glass-fronted recording chamber 
implanted over the V1 of the animals. The primary innovation here consisted 
of our using two imaging wavelengths. Two arrays of fast, high-intensity light- 
emitting diodes at the two wavelengths (530 nm, 605 nm) were switched on and 
off alternately in synchrony with the camera, thus illuminating the brain surface 
alternately with each wavelength on successive camera frames (15 frames per 
second). The illumination alternated much faster than typical haemodynamic 
signal timescales giving, in effect, simultaneous optical imaging at both wave- 
lengths at 7.5 frames per second. Increased absorption (darkening) at 530nm 
indicated an increase in total haemoglobin (‘blood volume’). Increased absorp- 
tion at 605 nm primarily indicated an increase in deoxyhaemoglobin, from a 
combination of increased deoxygenation and blood volume. 

For the dark-room fixation task, in a completely dark room, with a mask 
covering even the stimulus presentation monitor, the animal was cued to fixate 
or relax by the colour of a fixation point visible through a pinhole in the mask 
(size 1-2 arcmin), typically switching between equiluminant green (‘fixate’) and 
red (‘relax’). We dark-adapted alongside the animal to confirm that nothing else 
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was visible. Control experiments confirmed that the trial-related signal was 
independent of the brightness (range: 10), colour and size (range: 25 X in area) 
of the fixation point. 

All experimental procedures were performed in accordance with the National 
Institutes of Health Guide for the Care and Use of Laboratory Animals and were 
approved by the Institutional Animal Care and Use Committees of Columbia 
University and the New York State Psychiatric Institute. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Tasks: visual fixation. Two monkeys were trained on a variety of visual tasks 
with a common periodic fixation schedule cued by fixation point colour. The 
tasks required only passive fixation during ‘fixation on’ for juice reward (fixation 
window: 0.5° radius; monitor distance: 133 cm; fix duration: 4s within trials of 
duration ranging from 6 to 30s; trial duration typically held fixed for a given 
experiment; on some experiments, trial timing switched in blocks between two 
or three specific values; in other, control experiments, it was randomized by 
drawing numbers from a homogenous set). Eye fixation and pupil diameter 
recorded using an infrared eye tracker”. 

Tasks: auditory control. See Supplementary Fig. 10, auditory pitch discrimina- 
tion task, in a completely dark room (lacking even the fixation point). The trial 
sequence was as follows: animal pulls lever (start trial) — fixed delay (range 
4-10 s in different experiments) — auditory tone onset — delay (typically 4s) > 
tone changes pitch, cue to release lever as quickly as possible (200 ms), for juice 
reward. Once the animals learned this task they performed trials in rapid suc- 
cession. Thus we could determine task periodicity by setting the initial delay 
from lever pull to tone on. The monkey typically continued looking in the 
general direction of the fixation point (even though none was present), allowing 
us to track pupil dilation with the infrared camera. 

Optical imaging: surgery, recording chambers, artificial dura. After the 
monkeys were trained on visual fixation tasks, craniotomies were performed 
over the animals’ V1, and glass-windowed stainless steel recording chambers 
were implanted, under surgical anaesthesia, using standard sterile proce- 
dures**”®, so as to image an area of V1 of about 10 mm covering a visual eccent- 
ricity of about 1-5°. The exposed dura was resected and replaced with a soft, clear 
silicone artificial dura. After the animals had recovered from the surgery, cortical 
activity from their V1 was optically imaged, routinely, while they engaged in 
relevant behavioural tasks. Recording chambers and artificial dura were fabri- 
cated in our laboratory using published methods”. 

Optical imaging: hardware. The camera was a Dalsa 1M30P (binned to 256 
pixels X 256 pixels, 15 frames per second). The frame grabber was an Optical PCI 
Bus Digital (Coreco Imaging). The software was developed in our laboratory 
based on a system by V. Kalatsky**. The illumination was high-intensity light- 
emitting diodes (Agilent Technologies, Purdy Technologies), with emission 
wavelengths centred at 530 and 605 nm, filtered through small individual inter- 
ference filters (Omega Optical). The lens was a ‘macroscope’ of back-to-back 
camera lenses” focused on the cortical surface. Image acquisition was continu- 
ous, simultaneously recording signals from camera, trial timing and behavioural 
data (trial onset, stimulus onset, identity and duration, etc., eye position, pupil 
size, timing of fixation breaks, fixation acquisitions, trial outcomes). Data were 
analysed offline using custom software (MATLAB). 

Optical imaging: image processing. All images were first corrected for residual 
brain movements by aligning each frame to the first frame (shift + rotation’’), 
using blood vessels. Signal means (for example, in Fig. 1b) were obtained by 
averaging signals over the full area, then dividing by the trial-mean of this average 
to give the percentage signal change as a function of time in a trial. Images of 
cortical signal (Figs la and 4b and Supplementary Movie) were obtained by 
aligning image sequences to a selected time point (for example, trial onset) 
and averaging, frame by frame, across the set of all correct trials. This gave 
‘movies’ of cortical activity at the camera frame rate (7.5 frames per second at 
each wavelength). For images of stimulus-evoked responses (Fig. 1a) (stimulus: 
0.25° bar, flashed on for 1s at the start of each fixation trial), each frame in the 
movie was then divided, pixel by pixel, by the mean pre-stimulus image (five 
frames preceding time = 0 ms). The ‘Blank’ response in Fig. 1b was obtained the 
same way, at the same time point (3.3 s after stimulus onset), but on a fixation 
trial with no stimulus. For imaging the trial-related signal (Fig. 4b and 
Supplementary Movie), each frame in the movie was divided, pixel by pixel, 
by the trial mean (average of all images over one trial duration), rather than pre- 
stimulus image, to give the image of fractional signal change relative to the trial 
mean (Fig. 4b and Supplementary 9). To get time courses of blood-vessel signal 
relative to the mean (Fig. 4c), the signal was measured along test lines, sampling 
veins, arteries and ‘parenchyma’, and the ‘parenchyma baseline’ regressed away 
(Supplementary Fig. 9). To get movies of stimulus-evoked activity, similar 
movies were obtained of cortical activity aligned to stimulus onset, both for 
trials with stimulus present (‘stimulated’) and absent (‘blank’). Movie frames 
were divided by the pre-stimulus baseline of three to five frames immediately 
preceding stimulus onset to get the overall change in cortical activity after 
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stimulus (Supplementary Fig. 1b). We only included trials where the animal 
maintained fixation correctly. 

Visual stimuli for comparing stimulated versus dark-room responses. 
Gratings were optimized to stimulate the recorded location, typically 100% 
contrast, 4c/deg, drifting at 4° per second. 

Electrophysiology: hardware, electronics and analysis. Extracellular electrode 
recordings (plastic-coated tungsten: FHC or tungsten in glass; impedances 300- 
800 kQ; Plexon amplifier and recording software) were conducted simultaneously 
with optical imaging (Supplementary Fig. 1). Penetrations were distributed over 
imaged V1. Recording sites sampled cortical depths starting from most superficial 
to about 1500 tm below the pial surface at steps of 200-400 jum (Supplementary 
Table 1). The electrode signal was split into ‘spiking’ (100 Hz to 8 KHz bandpass) 
and LFP (0.7-170 Hz). MUA events were defined as each negative-going crossing 
of a threshold equal to about four times the root mean square of the baseline 
obtained while the animal looked at a grey screen (Supplementary Fig. 1). 
Electrophysiology: data processing. MUA were binned into 16.67-ms bins and 
aligned to the haemodynamic traces using simultaneously recorded synch events. 
LFP data were spectrally decomposed using mtspecgramc (Chronux Toolbox for 
MATLAB; sliding window of | s, a step size of 250 ms, frequency range 10-130 Hz) 
and interpolated into a continuous power spectrum aligned to the haemodynamic 
traces. Two-dimensional spectrograms (Fig. 2a bottom) show the trial-triggered 
mean LFP power normalized by the mean pre-trial power in the dark signal (2 Hz 
frequency resolution). The LFP time course (Supplementary Figs 4 and 6) shows 
the bandpass-filtered power, integrated over each relevant frequency band (‘low- 
frequency’ 10-56 Hz, or ‘high-frequency’ 66-130 Hz, avoiding 60 Hz). 
Electrophysiology: fitting to haemodynamics. For each electrophysiological 
measure (MUA, low-frequency LFP, high-frequency LFP) the ‘best’ kernel pre- 
dicting haemodynamics from concurrent electrophysiology was calculated 
(Supplementary Fig. 3). Correct trials were extracted from the continuous time 
series and concatenated into a continuous series. We modelled the haemo- 
dynamic response function (HRF) as a gamma kernel: HRF(t, T, W, A) = AX (t/ 
T)” X exp[(t— T)/—f], where «=(T/W)? X 8.0 X log(2.0), B= W/T/8.0/ 
log(2.0), A is the amplitude, T is the time to peak and W is the full width at 
75% maximum. We fit the kernel parameters using a downhill simplex algorithm 
(fminsearch, MATLAB) by comparing the actual haemodynamic response 
obtained during stimulated trials to that predicted from a convolution of the 
HRE with the corresponding spike or gamma-band (66-130 Hz) LFP trace. The 
algorithm reliably converged to similar temporal HRF parameters across all days 
(T= 2.50 (0.08)s, W= 1.68 (0.06)s). The proportion of the variance in the 
haemodynamic responses explained by neuronal activity was quantified using 
the R° statistic from linear regression of the predicted haemodynamic trace to 
the observed trace both for the stimulated and the dark-room trials. 

Controls for trial timing. We performed control experiments to verify that the 
observed signals were tied specifically to task-related trial onsets, independent of 
other timing signals. We confirmed that the signal periodicity was not linked to 
the animal either acquiring or breaking fixation: the two time points, during each 
trial, with any change in light on the retina (albeit minuscule). This also ruled out 
any links to extra-retinal fixation-related V1 activity. We controlled for the 
rhythmic pupil dilations, that is, for the possibility that cortical signals were 
being evoked by the accompanying pulse of extra light. Giving the animal simu- 
lated pupil dilations—a bright flash in the fixation point—evoked no cortical 
response. As a control for the possibility that the signal was an accidental match 
to ongoing oscillations, we introduced 20% jitter in trial timing; the signal 
specifically entrained to trial onsets. 
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A core gut microbiome in obese and lean twins 
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Andrew C. Heath’, Rob Knight* & Jeffrey |. Gordon’ 


The human distal gut harbours a vast ensemble of microbes (the 
microbiota) that provide important metabolic capabilities, includ- 
ing the ability to extract energy from otherwise indigestible dietary 
polysaccharides’ *. Studies of a few unrelated, healthy adults have 
revealed substantial diversity in their gut communities, as mea- 
sured by sequencing 16S rRNA genes®**, yet how this diversity 
relates to function and to the rest of the genes in the collective 
genomes of the microbiota (the gut microbiome) remains obscure. 
Studies of lean and obese mice suggest that the gut microbiota 
affects energy balance by influencing the efficiency of calorie har- 
vest from the diet, and how this harvested energy is used and 
stored**. Here we characterize the faecal microbial communities 
of adult female monozygotic and dizygotic twin pairs concordant 
for leanness or obesity, and their mothers, to address how host 
genotype, environmental exposure and host adiposity influence 
the gut microbiome. Analysis of 154 individuals yielded 9,920 near 
full-length and 1,937,461 partial bacterial 16S rRNA sequences, 
plus 2.14 gigabases from their microbiomes. The results reveal that 
the human gut microbiome is shared among family members, but 
that each person’s gut microbial community varies in the specific 
bacterial lineages present, with a comparable degree of co-variation 
between adult monozygotic and dizygotic twin pairs. However, 
there was a wide array of shared microbial genes among sampled 
individuals, comprising an extensive, identifiable ‘core micro- 
biome’ at the gene, rather than at the organismal lineage, level. 
Obesity is associated with phylum-level changes in the microbiota, 
reduced bacterial diversity and altered representation of bacterial 
genes and metabolic pathways. These results demonstrate that a 
diversity of organismal assemblages can nonetheless yield a core 
microbiome at a functional level, and that deviations from this core 
are associated with different physiological states (obese compared 
with lean). 

We characterized gut microbial communities in 31 monozygotic 
twin pairs, 23 dizygotic twin pairs and, where available, their mothers 
(n= 46) (Supplementary Tables 1-5). Monozygotic and dizygotic 
co-twins and parent-offspring pairs provided an attractive model 
for assessing the impact of genotype and shared early environmental 
exposures on the gut microbiome. Moreover, genetically ‘identical’? 
monozygotic twin pairs gain weight in response to overfeeding in a 
more reproducible way than unrelated individuals'® and are more 
concordant for body mass index (BMI) than dizygotic twin pairs!’. 

Twin pairs who had been enrolled in the Missouri Adolescent 
Female Twin Study (MOAFTS”’) were recruited for this study (mean 
period of enrolment in MOAFTS, 11.7 + 1.2 years; range, 4.4—13.0 
years). Twins were 21-32 years old, of European or African ancestry, 
and were generally concordant for obesity (BMI>30kgm’) or 


leanness (BMI = 18.5-24.9 kg m ”) (one twin pair was lean/over- 
weight (overweight defined as BMI = 25 and < 30) and six pairs were 
overweight/obese). They had not taken antibiotics for at least 
5.49 + 0.09 months. Each participant completed a detailed medical, 
lifestyle and dietary questionnaire: study enrolees were broadly 
representative of the overall Missouri population for BMI, parity, 
education and marital status (see Supplementary Results). 
Although all were born in Missouri, they currently live throughout 
the USA: 29% live in the same house, but some live more than 800 km 
apart. Because faecal samples are readily attainable and representative 
of interpersonal differences in gut microbial ecology’, they were col- 
lected from each individual and frozen immediately. The collection 
procedure was repeated again with an average interval between 
sampling of 57 + 4 days. 

To characterize the bacterial lineages present in the faecal micro- 
biotas of these 154 individuals, we performed 16S rRNA sequencing, 
targeting the full-length gene with an ABI 3730xl capillary sequencer. 
Additionally, we performed multiplex pyrosequencing with a 454 
FLX instrument to survey the gene’s V2 variable region'’ and its 
V6 hypervariable region (Supplementary Tables 1-3). 

Complementary phylogenetic and taxon-based methods were 
used to compare 16S rRNA sequences among faecal communities 
(see Methods). No matter which region of the gene was examined, 
individuals from the same family (a twin and her co-twin, or twins 
and their mother) had a more similar bacterial community structure 
than unrelated individuals (Fig. la and Supplementary Fig. la, b), 
and shared significantly more species-level phylotypes (16S rRNA 
sequences with =97% identity comprise each phylotype) 
(G=55.2, P<10" (V2); G=12.3, P<0.001 (V6); G=11.3, 
P<0.001 (full-length) ). No significant correlation was seen between 
the degree of physical separation of family members’ current homes 
and the degree of similarity between their microbial communities 
(defined by UniFrac'’). The observed familial similarity was not due 
to an indirect effect of the physiological states of obesity versus lean- 
ness; similar results were observed after stratifying twin pairs and 
their mothers by BMI category (concordant lean or concordant obese 
individuals; Supplementary Fig. 2). Surprisingly, there was no sig- 
nificant difference in the degree of similarity in the gut microbiotas of 
adult monozygotic compared with dizygotic twin pairs (Fig. 1a). 
However, we could not assess whether monozygotic and dizygotic 
twin pairs had different degrees of similarities at earlier stages of their 
lives. 

Multiplex pyrosequencing of V2 and V6 amplicons allowed higher 
levels of coverage compared with what was feasible using Sanger 
sequencing, reaching on average 3,9844232 (V2) and 
24,786 + 1,403 (V6) sequences per sample. To control for differences 
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Figure 1| 16S rRNA gene surveys reveal familial similarity and reduced 
diversity of the gut microbiota in obese individuals. a, Average unweighted 
UniFrac distance (a measure of differences in bacterial community 
structure) between individuals over time (self), twin pairs, twins and their 
mother, and unrelated individuals (1,000 sequences per V2 data set; 
Student’s t-test with Monte Carlo; *P < 10°; **P<10 4; ***P<10 4); 
mean ~ s.e.m.). b, Phylogenetic diversity curves for the microbiota of lean 
and obese individuals (based on 1—10,000 sequences per V6 data set; 
mean + 95% confidence intervals shown). 


in coverage, all analyses were performed on an equal number of 
randomly selected sequences (200 full-length, 1,000 V2 and 10,000 
V6). At this level of coverage, there was little overlap between the 
sampled faecal communities. Moreover, the number of 16S rRNA 
gene sequences belonging to each phylotype varied greatly between 
faecal microbiotas (Supplementary Tables 6-8). 

Because this apparent lack of overlap could reflect the level of 
coverage (Supplementary Tables 1-3), we subsequently searched all 
hosts for bacterial phylotypes present at high abundance using a 
sampling model based on a combination of standard Poisson and 
binomial sampling statistics. The analysis allowed us to conclude that 
no phylotype was present at more than about 0.5% abundance in all 
of the samples in this study (see Supplementary Results). Finally, we 
sub-sampled our data set by randomly selecting 50—3,000 sequences 
per sample; again, no phylotypes were detectable in all individuals 
sampled within this range of coverage (Supplementary Fig. 3). 

Samples taken from the same individual at the initial collection 
point and 57 + 4 days later were consistent with respect to the specific 
phylotypes found (Supplementary Figs 4 and 5), but showed varia- 
tions in relative abundance of the major gut bacterial phyla 
(Supplementary Fig. 6). There was no significant association between 
UniFrac distance and the time between sample collections. Overall, 
faecal samples from the same individual were much more similar to 
one another than samples from family members or unrelated indi- 
viduals (Fig. 1a), demonstrating that short-term temporal changes in 
community structure within an individual are minor compared with 
inter-personal differences. 
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Analysis of 16S rRNA data sets produced by the three PCR-based 
methods, plus shotgun sequencing of community DNA (see below), 
revealed a lower proportion of Bacteroidetes and a higher proportion 
of Actinobacteria in obese compared with lean individuals of both 
ancestries (Supplementary Table 9). Combining the individual P 
values across these independent analyses using Fisher’s method dis- 
closed significantly fewer Bacteroidetes (P= 0.003), more 
Actinobacteria (P=0.002) but no significant difference in 
Firmicutes (P= 0.09). These findings agree with previous work 
showing comparable differences in both taxa in mice’ and a progress- 
ive increase in the representation of Bacteroidetes when 12 unrelated, 
obese humans lost weight after being placed on one of two reduced- 
calorie diets®. 

Across all methods, obesity was associated with a significant 
decrease in the level of diversity (Fig. 1b and Supplementary Fig. 
lc-f). This reduced diversity suggests an analogy: the obese gut 
microbiota is not like a rainforest or reef, which are adapted to high 
energy flux and are highly diverse; rather, it may be more like a 
fertilizer runoff where a reduced-diversity microbial community 
blooms with abnormal energy input’®. 

We subsequently characterized the microbial lineage and gene 
content of the faecal microbiomes of 18 individuals representing 
six of the families (three lean and three obese European ancestry 
monozygotic twin pairs and their mothers) through shotgun pyro- 
sequencing (Supplementary Tables 4 and 5) and BLASTX compar- 
isons against several databases (KEGG” (version 44) and STRING"*) 
plus a custom database of 44 reference human gut microbial genomes 
(Supplementary Figs 7-10 and Supplementary Results). Our analysis 
parameters were validated using control data sets comprising ran- 
domly fragmented microbial genes with annotations in the KEGG 
database’ (Supplementary Fig. 11 and Supplementary Methods). 
We also tested how technical advances that produce longer reads 
might improve these assignments by sequencing faecal community 
samples from one twin pair using Titanium pyrosequencing methods 
(average read length of 341 + 134 nucleotides (s.d.) versus 208 + 68 
nucleotides for the standard FLX method). Supplementary Fig. 12 
shows that the frequency and quality of sequence assignments is 
improved as read length increases from 200 to 350 nucleotides. 

The 18 microbiomes were searched to identify sequences matching 
domains from experimentally validated carbohydrate-active 
enzymes (CAZymes). Sequences matching 156 total CAZy families 
were found within at least one human gut microbiome, including 77 
glycoside hydrolase, 21 carbohydrate-binding module, 35 glycosyl- 
transferase, 12 polysaccharide lyase and 11 carbohydrate-esterase 
families (Supplementary Table 10). On average, 2.62 + 0.13% of 
the sequences in the gut microbiome could be assigned to 
CAZymes (a total of 217,615 sequences), a percentage that is greater 
than the most abundant KEGG pathway (‘Transporters’; 
1.20 + 0.06% of the filtered sequences generated from each sample) 
and indicative of the abundant and diverse set of microbial genes 
directed towards accessing a wide range of polysaccharides. 

Category-based clustering of the functions from each microbiome 
was performed using principal components analysis (PCA) and hier- 
archical clustering’’. Two distinct clusters of gut microbiomes were 
identified based on metabolic profile, corresponding to samples with 
an increased abundance of Firmicutes and Actinobacteria, and sam- 
ples with a high abundance of Bacteroidetes (Fig. 2a). A linear regres- 
sion of the first principal component (PC1, explaining 20% of the 
functional variance) and the relative abundance of the Bacteroidetes 
showed a highly significant correlation (R* = 0.96, P<107'’; 
Fig. 2b). Functional profiles stabilized within each individual’s 
microbiome after 20,000 sequences had been accumulated 
(Supplementary Fig. 13). Family members had more similar profiles 
than unrelated individuals (Fig. 2c), suggesting that shared bacterial 
community structure (‘who’s there’ based on 16S rRNA analyses) 
also translates into shared community-wide relative abundance of 
metabolic pathways. Accordingly, a direct comparison of functional 
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Figure 2 | Metabolic-pathway-based clustering and analysis of the human 
gut microbiome of monozygotic twins. a, Clustering of functional profiles 
based on the relative abundance of KEGG metabolic pathways. All pairwise 
comparisons were made of the profiles by calculating each R’ value. Sample 
identifier nomenclature: family number, twin number or mother, and BMI 
category (Le, lean; Ov, overweight; Ob, obese; for example, F1T1Le stands 
for family 1, twin 1, lean). b, The relative abundance of Bacteroidetes as a 
function of the first principal component derived from an analysis of KEGG 
metabolic profiles. c, Comparisons of functional similarity between twin 
pairs, between twins and their mother, and between unrelated individuals. 
Asterisk indicates significant differences (Student’s t-test with Monte Carlo; 
P<0.01; mean + s.e.m.). 


and taxonomic similarity (see Supplementary Methods) disclosed a 
significant association: individuals with similar taxonomic profiles 
also share similar metabolic profiles (P< 0.001; Mantel test). 

Functional clustering of phylum-wide sequence bins representing 
microbiome reads assigned to 23 human gut Firmicutes and 14 
Bacteroidetes reference genomes showed discrete clustering by 
phylum (Supplementary Figs 14a and 15). Bootstrap analyses of the 
relative abundance of metabolic pathways in the microbiome-derived 
Firmicutes and Bacteroidetes sequence bins disclosed 26 pathways 
with a significantly different relative abundance (Supplementary 
Fig. 14a). The Bacteroidetes bins were enriched for several carbohyd- 
rate metabolism pathways, whereas the Firmicutes bins were enriched 
for transport systems. This finding is consistent with our CAZyme 
analysis, which revealed a significantly higher relative abundance of 
glycoside hydrolases, carbohydrate-binding modules, glycosyltrans- 
ferases, polysaccharide lyases and carbohydrate esterases in the 
Bacteroidetes sequence bins (Supplementary Fig. 14b). 

One of the major goals of the International Human Microbiome 
Project(s) is to determine whether there is an identifiable ‘core 
microbiome’ of shared organisms, genes or functional capabilities 
found in a given body habitat of all or the vast majority of humans’. 
Although all of the 18 gut microbiomes surveyed showed a high level 
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of B-diversity with respect to the relative abundance of bacterial phyla 
(Fig. 3a), analysis of the relative abundance of broad functional cat- 
egories of genes and metabolic pathways (KEGG) revealed a generally 
consistent pattern regardless of the sample surveyed (Fig. 3b and 
Supplementary Table 11): the pattern is also consistent with results 
we obtained from a meta-analysis of previously published gut micro- 
biome data sets from nine adults’! (Supplementary Fig. 16). This 
consistency is not simply due to the broad level of these annotations, 
as a similar analysis of Bacteroidetes and Firmicutes reference gen- 
omes revealed substantial variation in the relative abundance of each 
category (see Supplementary Fig. 17). Furthermore, pairwise com- 
parisons of metabolic profiles obtained from the 18 microbiomes in 
this study revealed an average value of R* of 0.97 + 0.002 (Fig. 2a), 
indicating a high level of functional similarity. 

Overall functional diversity was compared using the Shannon 
index”’, a measurement that combines diversity (the number of dif- 
ferent metabolic pathways) and evenness (the relative abundance of 
each pathway). The human gut microbiomes surveyed had a stable 
and high Shannon index value (4.63 + 0.01), close to the maximum 
possible level of functional diversity (5.54; see Supplementary 
Methods). Despite the presence of a small number of abundant meta- 
bolic pathways (listed in Supplementary Table 11), the overall func- 
tional profile of each gut microbiome is quite even (Shannon evenness 
of 0.84 + 0.001 ona scale of 0-1), demonstrating that most metabolic 
pathways are found at a similar level of abundance. Interestingly, the 
level of functional diversity in each microbiome was significantly 
linked to the relative abundance of the Bacteroidetes (R* = 0.81, 
P< 10 °); microbiomes enriched for Firmicutes/Actinobacteria had 
a lower level of functional diversity. This observation is consistent with 
an analysis of simulated metagenomic reads generated from each of 36 
Bacteroidetes and Firmicutes genomes (Supplementary Fig. 18): on 
average, the Bacteroidetes genomes have a significantly higher level of 
both functional diversity and evenness (Mann—Whitney U-test, 
P<0.01). 

At a finer level, 26-53% of ‘enzyme’-level functional groups 
(KEGG/CAZy/STRING) were shared across all 18 microbiomes, 
whereas 8-22% of the groups were unique to a single microbiome 
(Supplementary Fig. 19a—c). The ‘core’ functional groups present in 
all microbiomes were also highly abundant, representing 93-98% of 
the total sequences. Given the higher relative abundance of these ‘core’ 
groups, more than 95% were found after 26.11 + 2.02 megabases of 
sequence were collected from a given microbiome, whereas the ‘vari- 
able’ groups continued to increase substantially with each additional 
megabase of sequence. Of course, any estimate of the total size of the 
core microbiome will depend on sequencing effort, especially for 
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Figure 3 | Comparison of taxonomic and functional variations in the human 
gut microbiome. a, Relative abundance of major phyla across 18 faecal 
microbiomes from monozygotic twins and their mothers, based on BLASTX 
comparisons of microbiomes and the National Center for Biotechnology 
Information non-redundant database. b, Relative abundance of categories of 
genes across each sampled gut microbiome (letters correspond to categories 
in the COG database). 
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functional groups found at a low abundance. On average, our survey 
achieved more than 450,000 sequences per faecal sample, which, 
assuming an even distribution, would allow us to sample groups 
found at a relative abundance of 10° *. To estimate the total size of 
the core microbiome based on the 18 individuals, we randomly sub- 
sampled each microbiome in 1,000 sequence intervals (Supple- 
mentary Fig. 19d). Based on this analysis, the core microbiome is 
approaching a total of 2,142 total orthologous groups (one site bind- 
ing (hyperbola) curve fit, R? = 0.9966), indicating that we identified 
93% of functional groups (defined by STRING) found within the core 
microbiome of the 18 individuals surveyed. Of these core groups, 71% 
(CAZy), 64% (KEGG) and 56% (STRING) were also found in the 
nine previously published, but much lower coverage, data sets gener- 
ated by capillary sequencing of adult faecal DNA*®”’ (average of 
78,413 + 2,044 bidirectional reads per sample; see Supplementary 
Methods). 

Metabolic reconstructions of the ‘core’ microbiome revealed sig- 
nificant enrichment for several expected functional categories, 
including those involved in transcription and translation (Fig. 4). 
Metabolic profile-based clustering indicated that the representation 
of ‘core’ functional groups was highly consistent across samples 
(Supplementary Fig. 20), and included several pathways that are 
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Figure 4 | KEGG categories enriched or depleted in the core versus variable 
components of the gut microbiome. Sequences from each of the 18 faecal 
microbiomes were binned into the ‘core’ or ‘variable’ microbiome based on 
the co-occurrence of KEGG orthologous groups (core groups were found in 
all 18 microbiomes whereas variable groups were present in fewer (<18) 
microbiomes; see Supplementary Fig. 19a). Asterisks indicate significant 
differences (Student’s t-test, *P < 0.05, **P < 0.001, ***P<10 >; 

mean + s.e.m.). 
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likely important for life in the gut, such as those for carbohydrate 
and amino-acid metabolism (for example, fructose/mannose meta- 
bolism, amino-sugar metabolism and N-glycan degradation). 
Variably represented pathways and categories include cell motility 
(only a subset of Firmicutes produce flagella), secretion systems and 
membrane transport (for example, phosphotransferase systems 
involved in the import of nutrients, including sugars; Fig. 4 and 
Supplementary Fig. 20). 

The distribution of CAZy glycoside hydrolase and glycosyltrans- 
ferase families was compared between each pair of microbiomes (see 
Supplementary Table 10 for CAZy families with a relative abundance 
greater than 1%). This analysis revealed that all individuals had a 
similar profile of glycosyltransferases (R* = 0.96 + 0.003), whereas 
the profiles of glycoside hydrolases were significantly more variable, 
even between family members (R? = 0.80 + 0.01; P< 10~*°, paired 
Student’s t-test). This suggests that the number and spectrum of 
glycoside hydrolases is affected by ‘external’ factors such as diet more 
than the glycosyltransferases. 

To identify metabolic pathways associated with obesity, only non- 
core associated (variable) functional groups were included in a com- 
parison of the gut microbiomes of lean versus obese twin pairs. A 
bootstrap analysis’* was used to identify metabolic pathways that 
were enriched or depleted in the variable obese gut microbiome. 
For example, similar to a mouse model of diet-induced obesity’, 
the obese human gut microbiome was enriched for phosphotransfer- 
ase systems involved in microbial processing of carbohydrates 
(Supplementary Table 12). All gut microbiome sequences were com- 
pared with the custom database of 44 human gut genomes: an odds 
ratio analysis revealed 383 genes that were significantly different 
between the obese and lean gut microbiome (q value < 0.05; 273 
enriched and 110 depleted in the obese microbiome; 
Supplementary Tables 13 and 14). By contrast, only 49 genes were 
consistently enriched or depleted between all twin pairs (see 
Supplementary Methods). 

These obesity-associated genes were representative of the taxo- 
nomic differences described above: 75% of the obesity-enriched 
genes were from Actinobacteria (compared with 0% of lean-enriched 
genes; the other 25% are from Firmicutes) whereas 42% of the lean- 
enriched genes were from Bacteroidetes (compared with 0% of the 
obesity-enriched genes). Their functional annotation indicated that 
many are involved in carbohydrate, lipid and amino-acid metabol- 
ism (Supplementary Tables 13 and 14). Together, they comprise an 
initial set of microbial biomarkers of the obese gut microbiome. 

Our finding that the gut microbial community structures of adult 
monozygotic twin pairs had a degree of similarity that was compar- 
able to that of dizygotic twin pairs, and only slightly more similar 
than that of their mothers, is consistent with an earlier fingerprinting 
study of adult twins”, and with a recent microarray-based analysis, 
which revealed that gut community assembly during the first year of 
life followed a more similar pattern in a pair of dizygotic twins than 
12 unrelated infants”. Intriguingly, another fingerprinting study of 
monozygotic and dizygotic twins in childhood showed a slightly 
reduced similarity profile in dizygotic twins”’. Thus, comprehensive 
time-course studies, comparing monozygotic and dizygotic twin 
pairs from birth through adulthood, as well as intergenerational 
analyses of their families’ microbiotas, will be key to determining 
the relative contributions of host genotype and environmental expo- 
sures to (gut) microbial ecology. 

The hypothesis that there is a core human gut microbiome, defin- 
able by a set of abundant microbial organismal lineages that we all 
share, may be incorrect: by adulthood, no single bacterial phylotype 
was detectable at an abundant frequency in the guts of all 154 
sampled humans. Instead, it appears that a core gut microbiome 
exists at the level of shared genes, including an important component 
involved in various metabolic functions. This conservation suggests a 
high degree of redundancy in the gut microbiome and supports an 
ecological view of each individual as an ‘island’ inhabited by unique 
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collections of microbial phylotypes: as in actual islands, different 
species assemblages converge on shared core functions provided by 
distinctive components. Our findings raise the question of how core 
functionality is assembled in this body habitat. Understanding the 
underlying principles should provide insights about microbial 
adaptation to, and mutualistic community assembly within, a wide 
range of environments. 


METHODS SUMMARY 


Faecal samples were collected from each individual. Community DNA was pre- 
pared and used for pyrosequencing (454 Life Sciences), as well as for PCR and 
sequencing of bacterial 16S rRNA genes. Shotgun reads were mapped to ref- 
erence genomes using the National Center for Biotechnology Information ‘non- 
redundant’ database, KEGG"’, STRING"’, CAZy (http://www.cazy.org/) and a 
44-member human-gut microbial genome database. Metabolic reconstructions 
were performed based on CAZy, KEGG and STRING annotations. The relative 
abundance of KEGG metabolic pathways is referred to as a ‘metabolic profile’. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Community DNA preparation. Faecal samples were frozen immediately after 
they were produced. De-identified samples were stored at — 80 °C before proces- 
sing. Ten to twenty grams of each sample was pulverized in liquid nitrogen with a 
mortar and pestle. An aliquot (approximately 500 mg) of each sample was then 
suspended, while frozen, in a solution containing 500 ul of extraction buffer 
(200 mM Tris (pH 8.0), 200mM NaCl, 20mM EDTA), 210 pl of 20% SDS, 
500 ul of a mixture of phenol:chloroform:isoamyl! alcohol (25:24:1, pH 7.9), 
and 500 ul of a slurry of 0.1-mm diameter zirconia/silica beads (BioSpec 
Products). Microbial cells were subsequently lysed by mechanical disruption 
with a bead beater (BioSpec Products) set on high for 2 min at room temper- 
ature, followed by extraction with phenol:chloroform:isoamyl alcohol, and pre- 
cipitation with isopropanol. DNA obtained from three separate aliquots of each 
faecal sample were pooled (=200 ug DNA) and used for pyrosequencing (see 
below). 

16S rRNA gene-sequence-based surveys. Complementary phylogenetic- and 
taxon-based methods were used to compare 16S rRNA sequences among faecal 
communities. Phylogenetic clustering with UniFrac’” is based on the principle that 
communities can be compared in terms of their shared evolutionary history, as 
measured by the degree to which they share branch length on a phylogenetic tree. 
We complemented this approach with taxon-based methods”, which disregard 
some of the information contained in the phylogenetic tree of the taxa in question, 
but have the advantage that specific taxa unique to, or shared among, groups of 
samples can be identified (for example, those from lean or obese individuals). 
Before both types of analysis, we grouped 16S rRNA gene sequences into opera- 
tional taxonomic units (OTUs/phylotypes) using both cd-hit”* and the furthest- 
neighbour-like algorithm, with a sequence identity threshold of 97%, which is 
commonly used to define ‘species’-level phylotypes. Taxonomy was assigned using 
the best-BLAST-hit against Greengenes” (E value cutoff of 10” '°, minimum 88% 
coverage, 88% identity) and the Hugenholtz taxonomy (downloaded from 
http://greengenes.Ibl.gov/Download/Sequence_Data/Greengenes_format/ on 12 
May 2008, excluding sequences annotated as chimaeric). 

Selection of operational taxonomic units. 16S rRNA gene-derived pyrosequen- 
cing data were pre-processed to remove sequences with low-quality scores, 
sequences with ambiguous characters or sequences outside the length bounds 
(V6 <50 nucleotides, V2 < 200 nucleotides), and binned according to sample- 
specific barcode (see, for example, ref. 13). Similar sequences were identified 
using Megablast*® and cd-hit, with the following parameters: E value 10° '° 
(Megablast only); minimum coverage 99%; minimum pairwise identity 97%. 
Candidate OTUs were identified as sets of sequences connected to each other at 
this level using a maximum of 4,000 hits per sequence. Each candidate OTU was 
considered valid if the average density of connection was above threshold; other- 
wise, it was broken up into smaller connected components”’. 

Tree building and UniFrac clustering for PCA analysis. A relaxed neighbour- 
joining tree was built from one representative sequence per OTU using 
Clearcut*', employing the Kimura correction (the PH Lane mask was applied 
to V2 and full-length data), but otherwise with default comparisons. Unweighted 
UniFrac’? was run using the resulting tree. PCA was performed on the resulting 
matrix of distances between each pair of samples. To determine if the UniFrac 
distances were on average significantly different for pairs of samples (that is, 
between twin pairs, between twins and their mother, or between unrelated indi- 
viduals), we performed a t-test on the UniFrac distance matrix, and generated a P 
value for the statistic by permutation of the rows and columns as in the Mantel 
test, regenerating the t-statistic for 1,000 random samples, and using the distri- 
bution to obtain an empirical P value. 

Rarefaction and phylogenetic diversity measurements. To determine which 
individuals had the most diverse communities of gut bacteria, rarefaction plots 
and phylogenetic diversity measurements, as described by Faith*’, were made for 
each sample. Phylogenetic diversity is the total amount of branch length in a 
phylogenetic tree constructed from the combined 16S rRNA data sets, leading to 
the sequences in a given sample. To account for differences in sampling effort 
between individuals, and to estimate how far we were from sampling the divers- 
ity of each individual completely, we plotted the accumulation of phylogenetic 
diversity (branch length) with sampling effort, in a manner analogous to 
rarefaction curves. We generated the phylogenetic diversity rarefaction curve 
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for each individual by applying custom python code (http://bmf2.colorado. 
edu/unifrac/about.psp) to the Arb parsimony insertion tree’’. 

Pyrosequencing of total community DNA. Shotgun sequencing runs were per- 
formed on the 454 FLX pyrosequencer from total faecal community DNA. Two 
samples were also analysed in a single run using Titanium extra-long-read pyr- 
osequencing technology (see Supplementary Tables 4 and 5). Sequencing reads 
with degenerate bases (‘Ns’) were removed along with all duplicate sequences, as 
sequences of identical length and content are a common artefact of the pyrose- 
quencing methodology. Finally, human sequences were removed by identifying 
sequences homologous to the Homo sapiens reference genome (BLASTN 
E<10-°, Yidentity > 75, score > 50). 

CAZyme analysis. Metagenomic sequence reads were searched against a library 
of modules derived from all entries in the carbohydrate-active enzymes (CAZy) 
database (www.cazy.org using FASTY*, E<10 °). This library consists of 
approximately 180,000 previously annotated modules (catalytic modules, 
carbohydrate-binding modules and other non-catalytic modules or domains 
of unknown function) derived from about 80,000 protein sequences. The num- 
ber of sequencing reads matching each CAZy family was divided by the number 
of total sequences assigned to CAZymes and multiplied by 100 to calculate a 
relative abundance. An R* value was calculated for each pair of CAZy profiles. 
We then compared the distribution of glycoside hydrolase similarity scores with 
the distribution of glycosyltransferase similarity scores. 

Statistical analyses. Xipe** (version 2.4) was used for bootstrap analyses of 
pathway enrichment and depletion, using the parameters sample size = 10,000 
and confidence level = 0.95. Linear regressions were performed in Excel (version 
11.0, Microsoft). Mann—Whitney and Student’s t-tests were used to identify 
statistically significant differences between two groups (Prism version 4.0, 
GraphPad; Excel version 11.0, Microsoft). The Bonferroni correction was used 
to correct for multiple hypotheses. The Mantel test was used to compare distance 
matrices: the matrix of each pairwise comparison of the abundance of each 
reference genome, and the abundance of each metabolic pathway, were com- 
pared (Mantel program in Python using PyCogent™; 10,000 replicates). Data are 
represented as mean = s.e.m. unless otherwise indicated. 

Microbiome sequences were compared against the custom database of 44 gut 
genomes (BLASTX E< 10°, bitscore > 50, and %identity > 50). A gene-by- 
sample matrix was then screened to identify genes ‘commonly-enriched’ in 
either the obese or lean gut microbiome (defined by an odds ratio greater than 
2 or less than 0.5 when comparing the pooled obese twin microbiomes with the 
pooled lean twin microbiomes, and when comparing each individual obese twin 
microbiome with the aggregate lean twin microbiome, or vice versa). The stati- 
stical significance of enriched or depleted genes was then calculated using a 
modified t-test (q value < 0.05; calculated with code supplied by M. Pop and 
J.R. White, University of Maryland). We also searched for genes that were con- 
sistently enriched or depleted in all six monozygotic twin pairs. A gene-by- 
sample matrix was generated based on BLASTX comparisons of each micro- 
biome with our custom 44-genome database, to calculate an odds ratio based on 
the frequency of each gene in each twin versus the respective co-twin. The 
analysis revealed only 49 genes (odds ratio > 2 or < 0.5): they represent a variety 
of taxonomic groups, including Firmicutes, Bacteroidetes and Actinobacteria, 
and did not show any clear functional trends. 
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Protein kinase R reveals an evolutionary model for 


defeating viral mimicry 


Nels C. Elde’, Stephanie J. Child*, Adam P. Geballe?** & Harmit S. Malik’ 


Distinguishing self from non-self is a fundamental biological chal- 
lenge. Many pathogens exploit the challenge of self discrimination 
by employing mimicry to subvert key cellular processes including 
the cell cycle, apoptosis and cytoskeletal dynamics'°. Other 
mimics interfere with immunity®’. Poxviruses encode K3L, a 
mimic of eIF2a, which is the substrate of protein kinase R 
(PKR), an important component of innate immunity in verte- 
brates*’. The PKR-K3L interaction exemplifies the conundrum 
imposed by viral mimicry. To be effective, PKR must recognize a 
conserved substrate (eIF2a) while avoiding rapidly evolving sub- 
strate mimics such as K3L. Using the PKR-K3L system and a 
combination of phylogenetic and functional analyses, we uncover 
evolutionary strategies by which host proteins can overcome mim- 
icry. We find that PKR has evolved under intense episodes of 
positive selection in primates. The ability of PKR to evade viral 
mimics is partly due to positive selection at sites most intimately 
involved in eIF2a recognition. We also find that adaptive changes 
on multiple surfaces of PKR produce combinations of substitu- 
tions that increase the odds of defeating mimicry. Thus, although 
it can seem that pathogens gain insurmountable advantages by 
mimicking cellular components, host factors such as PKR can 
compete in molecular ‘arms races’ with mimics because of evolu- 
tionary flexibility at protein interaction interfaces challenged by 
mimicry. 

To counteract viral infections, PKR phosphorylates the translation 
initiation factor eIF2x in the presence of double-stranded RNA 
(dsRNA) from viruses*’. This activity strongly inhibits protein syn- 
thesis and blocks the production of new virus particles. The crucial 
function of PKR in innate immunity is reflected by the evolution of 
numerous factors from various viruses that disable PKR to promote 
viral production", including a poxvirus-encoded mimic of elF2a 
called K3L (Supplementary Fig. 1). Host proteins such as PKR, which 
interact directly with viral antagonists such as K3L, can be subject to 
molecular ‘arms-races’ in which amino-acid substitutions that directly 
affect interactions can be rapidly fixed by positive selection'!”’. 

To determine whether PKR might be subject to positive selection, we 
cloned and sequenced complementary DNA of PKR from a panel of 20 
primates representing more than 30 million years of evolutionary 
divergence. By considering ratios of the rates of non-synonymous 
(dN) and synonymous (dS) substitutions, we found evidence for 
ancient, episodic positive selection in primate lineages (P< 0.0003; 
Fig. la and Supplementary Table 1). In particular, one branch in Old 
World monkeys was calculated to have undergone 22 non-synonymous 
substitutions without any synonymous changes, one of the most 
intense episodes of positive selection reported for any primate gene 
(Supplementary Data). Likelihood ratio tests'’ using the entire phylo- 
geny reveal that 17% of codons evolved with an average dN/dS ratio of 
3.7, strongly supporting a finding of positive selection (P< 0.0001; 


Supplementary Tables 2 and 3), even after accounting for the poten- 
tially confounding effects of recombination and synonymous site vari- 
ation’* (P< 0.0001; Supplementary Tables 4 and 5). Positive selection 
is observed in each of the three domains of PKR—the dsRNA-binding 
domain, the spacer region and even the kinase domain—which is con- 
sistent with an extensive history of facing viral factors that directly bind 
PKR in these separate domains (Supplementary Fig. 1). Several residues 
in the kinase domain, which make direct contacts with eIF2 (ref. 15), 
are among the fastest-evolving residues in PKR (Fig. lb and 
Supplementary Fig. 1), suggesting that selective pressure to evade 
eIF2% mimics may have driven changes in these residues. 

Similarly, we find that positive selection has acted on the el[F2x 
mimic K3L (Supplementary Fig. 2). For instance, in a comparison of 
K3L from variola major (smallpox) and vaccinia viruses, we find a 
dN/dS ratio of 2.80 (P< 0.001), whereas fewer than 10% of ortho- 
logues in vaccinia and variola comparisons show any evidence of 
positive selection (average dN/dS = 0.10; N.C.E. and H.S.M., unpub- 
lished observations). This suggests that poxviral eIF2~ mimics have 
also undergone positive selection and reflects the possibility that K3L 
has not achieved or maintained an optimal state of mimicry. Instead, 
K3L might continually evolve to counter adaptive changes in PKR. 

In contrast with the rapid evolution of PKR, its substrate, eIF2, is 
essentially unchanged in simian primates at the amino-acid level 
(dN/dS = 0 in a comparison of human and rhesus). Thus, PKR must 
recognize an unchanging substrate while evolving to discriminate 
against mimics such as K3L to be effective. If we consider that most 
viruses, including poxviruses'®’’, evolve at faster rates than primates, 
such challenges by mimics are daunting for hosts. Nevertheless, PKR 
can inhibit viruses encoding elF2« mimics’®, suggesting that adaptive 
changes in PKR might help to overcome mimicry by these factors. 

We investigated whether primate PKR orthologues differ in their 
ability to discriminate against K3L from vaccinia, the model poxvirus. 
Because the entire clade of extant poxviruses is very young relative to 
the divergence between primates’®'’, we could not investigate strict 
co-evolutionary dynamics between PKR and K3L. Instead, we used 
vaccinia K3L as a means to study the evolutionary strategies afforded 
PKR for counteracting substrate mimics that were faced over the 
course of primate evolution, which could leave PKR variants either 
susceptible or resistant to vaccinia K3L. Even though primate PKR 
alleles did not necessarily evolve under pressure from vaccinia K3L, 
our approach allowed us to identify the mechanisms by which host 
proteins might defeat mimicry more generally. Examining host—virus 
evolution from a similar perspective led to the identification of a 
region in the restriction factor Trim5a that confers specificity against 
ancient, extinct retroviruses but fails to protect humans from HIV™*”’. 

A growth assay in yeast has provided a simple test of PKR func- 
tion”. Human PKR recognizes and phosphorylates yeast elF2, as a 
result of its high level of similarity to primate eIF2a, to cause growth 
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Figure 1| Widespread positive selection has shaped PKR throughout 
primate evolution. a, PKR was sequenced from simian primates that 
together represent more than 30 million years of divergence. dN/dS values 
along each branch of the phylogeny are listed, and those with dN/dS > 1 are 
highlighted in red. Branches with bold lines, overlapping the set in red, 
indicate lineages found to be under positive selection by complementary 
model fitting analysis (see also Supplementary Table 6). Values in 
parentheses are shown for branches where no synonymous changes were 
observed (S = 0) and indicate the number of non-synonymous changes (N). 
b, Sites under positive selection (red) are mapped onto a ribbons 
representation of the complex of the PKR kinase domain (blue) with elF2« 
(green) (PDB code 2A1A)"’. The active site of PKR is shown in orange, and 
for technical reasons a large portion of the 4-5 loop (dashed blue line) is 
invisible from the structure deduced from the co-crystal’*. Residues under 
positive selection near the interface of PKR with eI[F2 and K3L are noted in 
the 84-85 loop (Thr 336, Asp 338, Ser 344, Ser 351) and the xD (Gln 376, 


arrest’**". We expressed ten divergent primate PKR cDNAs in yeast to 
determine whether they differed in their ability to phosphorylate 
eIF2a. All primate PKR genes tested caused consistent levels of 
growth arrest, which specifically depended on phosphorylation of 
elF2a (ref. 22) (Fig. 1c, middle). However, co-expression with vac- 
cinia virus K3L uncovered marked differences in K3L inhibition of 
primate PKR orthologues, which leads to a rescue of growth” (Fig. 1c, 
right). PKR alleles from Old World and New World monkeys, and 
from white-cheeked gibbon, were generally quite susceptible to sup- 
pression of growth arrest by K3L from vaccinia and variola, whereas 
other hominoid PKR alleles showed only modest suppression by K3L 
(Fig. 1c and Supplementary Fig. 3). Thus, rapid evolution of primate 
PKR did not seem to alter eIF2x recognition significantly, but 
resulted in considerable differences in susceptibility to K3L. In par- 
ticular, we find in the hominoid lineage that human, chimp, gorilla 
and orangutan PKR orthologues are 1,000-fold more resistant than 
gibbon PKR to growth rescue by K3L. 
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Lys 380) and «G (Phe 489, Ser 492, Thr 496) helices. c, Plasmids encoding 
PKR variants from a panel of primates under pGal were introduced into 
yeast strains HM3 (eIF2«), HM2 (eIF2« and haemagglutinin (HA)-epitope- 
tagged vaccinia K3L) and J223 (eIF2«-S51A). Tenfold serial dilutions of 
transformants were spotted on plates containing either glucose or galactose 
(see Methods). Immunoblot analysis of PKR (top panel) and HA-K3L 
(bottom panel) is also shown (see Methods). For African green monkey 
(AGM), resistance to K3L might reflect differences in PKR expression in 
yeast. d, Primary fibroblasts from the indicated primates were infected in 
triplicate with wild-type (filled columns) or AK3L (open columns) vaccinia 
virus (multiplicity of infection 0.001). Virus production was assessed three 
days after infection by titring cell lysates. The significance of wild-type virus 
compared with AK3L is indicated (Student’s t-test; error bars show s.d.). 
Minor variations of this experiment (not shown) revealed that AK3L 
infections typically produced about fivefold less virus than wild-type virus in 
gibbon cells. 


We further corroborated the large differences in K3L susceptibility 
uncovered by the yeast assay by infecting human, orangutan and 
gibbon fibroblast cell lines with either wild-type vaccinia virus or a 
strain with a K3L gene deletion (AK3L). Consistent with our yeast 
assays and previous reports in human cells** was our finding that 
AK3L virus had no significant effect on viral titre in human or orang- 
utan cells but led to a substantial decrease in titre in gibbon cells 
(Fig. 1d). Vaccinia virus therefore depends on K3L for full infectivity 
in gibbon cells, where PKR is susceptible to K3L. 

We wished to map critical genetic differences between ‘resistant’ 
and ‘susceptible’ PKR alleles to understand the basis of resistance to 
K3L. We first investigated helix «G of the kinase domain because 
residues 489, 492 and 496 have key functions in the recognition of 
eIF2a (ref. 15), yet they have evolved under recurrent positive selec- 
tion (Figs 1b and 2). Whereas gibbon PKR (helix «G: Tyr 489- 
Ala 492-Thr 496 or Y-A-T) is susceptible to K3L in growth assays, 
the human «G configuration (F-S-T) in an otherwise gibbon PKR 
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backbone increases the resistance of gibbon PKR to vaccinia K3L 
(Fig. 2a, rows 1 and 2). In fact, the A492S substitution (Y-S-T) alone 
confers on gibbon PKR greatly increased resistance to K3L (Fig. 2a, 
row 3). These findings reveal that even a single change in PKR at the 
common interface with substrate and mimic has the capacity to 
reverse a ‘susceptibility’ phenotype. 

A second determinant, not in helix «G, explains the resistance of 
orangutan PKR to K3L. When we tested the «G configuration (S-A- 
K) of the ‘resistant’ orangutan PKR allele (Fig. 1c) in the gibbon 
backbone, this S-A-K allele was still quite susceptible (Fig. 2a, row 
5). To identify the source of resistance of orangutan PKR, we tested 
chimaeras between orangutan and gibbon PKR and found that a 
region in the kinase domain containing helices «D and wE from 
orangutan PKR greatly increased the resistance of gibbon PKR to 
K3L (data not shown). When we tested individual substitutions in 
this region, we found that the F394L substitution of the «E helix 
conferred gibbon PKR with resistance against K3L (Fig. 2c). 
Importantly, the opposite L394F substitution greatly reduced resis- 
tance in orangutan PKR (Fig. 2c). Unlike helix «G, helix «E discrimi- 
nation seems to be independent of PKR contact with its substrate 
because it is positioned away from the eIF2a interface (Fig. 2d)’*. In 
addition, positive selection in helix «D suggested that this region 
could contribute to escaping mimicry, either directly or by virtue 
of co-evolution between helices «D and «%G (Supplementary Table 
9)*°. However, we did not find functional evidence for the involve- 
ment of xD in resisting K3L over the evolutionary timeframe that we 
examined for this particular mimic (Supplementary Fig. 4). 
Therefore, susceptible gibbon PKR alleles can gain resistance to vac- 
cinia K3L by single substitutions in either the «G helix or the aE helix 
(Fig. 2e), increasing the chances of escaping mimicry. 

Our analyses suggested that human PKR contained residues assoc- 
iated with increased resistance to K3L from both «G and aE helices. 
Indeed, we found that a human PKRallele carrying ‘susceptible’ muta- 
tions in both its wE (L394F) and «G (F489Y/S492A) helices loses wild- 
type resistance to K3L (Fig. 3a, row 5). We tested all combinations of 
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Figure 2 | Distinct surfaces of the PKR kinase domain are crucial to K3L 
resistance. a, Plasmids encoding gibbon PKR alleles with substitutions in 
the «G helix were introduced into yeast strains HM3 (eIF2« alone) and HM1 
(eIF2x and K3L). Tenfold serial dilutions of transformants are shown. A 
corresponding immunoblot analysis is also shown with antibodies against 
PKR (top) and K3L (bottom). b, A ribbon representation of the PKR-elF2« 
complex, highlighting the association of side chains of residues under 
positive selection with side chains of eI[F2«. Phe 489, Ser 492 and Thr 496 
form a face of the «G helix directly interacting with eIF2« (ref. 15). 
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resistant and susceptible substitutions at positions 394 (helix wE), 489 
and 492 (helix «G) in human PKR and found that six out of eight 
combinations of human PKRalleles resist K3L. The two exceptions are 
F-Y-A (described above) and F-F-A (Fig. 3a, row 4), which is only 
slightly more resistant than F-Y-A to K3L, revealing a weak effect 
associated with the positively selected residue at position 489. 
Although the human and gibbon PKR backbones bear similar out- 
comes at all positions (Fig. 3b), the ‘susceptible’ human alleles still 
seem more resistant than the ‘susceptible’ gibbon alleles to vaccinia 
K3L, hinting at an additional K3L resistance determinant in the 
human PKR sequence (data not shown). 

One of the most notable findings from testing the ‘susceptible’ and 
‘resistant’ PKR variants was that helices aE and «%G had distinct 
means of defeating K3L. Leu 394 resisted K3L regardless of whether 
human, gibbon or orangutan PKR had a ‘susceptible’ «G helix (Fig. 2c 
and Supplementary Fig. 5). The mutational profile of the E and «G 
helices is therefore not strictly independent, because helix «E masks 
aG in terms of K3L resistance. Only when helix aE is ‘susceptible’ 
does the configuration of «G matter. Residue 394 of helix «E toggles 
exclusively between leucine and phenylalanine at a much lower rate 
than residues of helix «G, not only in primates but also among 
mammals in general (Supplementary Fig. 6). Our finding that 
Leu 394 confers overriding resistance to vaccinia K3L strongly sug- 
gests that toggling can unmask potentially adaptive substitutions in 
the rapidly evolving «G helix. The fact that Phe 394 is fixed in numer- 
ous species, including the New World monkeys we sampled, suggests 
that phenylalanine rather than leucine might confer resistance 
against substrate mimics different from the two we tested in this 
study. Therefore, toggling at position 394 reveals how a single sub- 
stitution, in combination with positive selection in helix «G, might 
effectively increase the adaptive space that PKR can explore, greatly 
increasing the odds of defeating substrate mimics. 

Positive selection seems to be a major evolutionary driver of many 
host-pathogen interactions'’'**. Strong positive selection seen in 
both primate PKR and poxvirus K3L, and the presence of substrate 
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c, Plasmids encoding gibbon and orangutan PKR alleles with substitutions in 
the cE helix were introduced into yeast strains HM3 and HM1. Tenfold 
serial dilutions of transformants are shown, along with a corresponding 
immunoblot analysis. d, Residues under positive selection (Gln 376 and 
Lys 380) and residue Leu 394 from a ribbon representation of human PKR 
and eIF2a are shown"’. e, Diagram showing that single substitutions in 
either the «E or «G helices can confer resistance against vaccinia K3L to 
gibbon PKR. 
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Figure 3 | PKR chimaeras reveal masking of K3L sensitivity by Leu 394. 
a, Tenfold serial dilutions of transformants expressing alleles of human PKR 
with combinations of substitutions in the wE and aG helices are shown, 
along with a corresponding immunoblot analysis. b, Phenotype ‘cubes’ 
summarizing the K3L susceptibility of alleles with all combinations of 
substitutions between human and gibbon PKR at positions 394, 489 and 492 
from Figs 2a and 3a and Supplementary Fig. 5. Red and blue dots indicate 
resistance and sensitivity to K3L, respectively. With the exception of F-F-A, 
which shows some measure of resistance to K3L in the human background 
(indicated by the red crescent), each set of substitutions has similar 
phenotypes in the human and gibbon backgrounds. Each single substitution 


mimics in unrelated viruses’’, clearly indicates that both host and 
viral genomes have been under intense pressure to gain advantages in 
these ancient and continuing evolutionary battles. The positive selec- 
tion we observed in primate PKR is likely to reflect selection driven by 
ancient viruses with K3L-like factors that strongly influenced sus- 
ceptibility to present-day mimics. For example, positive selection in 
the gibbon lineage driven by ancient mimics may have left gibbon 
PKR susceptible to vaccinia K3L. Similar trade-offs have been 
observed for variants of antiviral proteins under strong positive selec- 
tion that might have defeated ancient retroviruses but are currently 
susceptible to HIV-1 (ref. 19). 

Mimicry adds a layer of complexity to host—pathogen interfaces. 
Because PKR must distinguish an essentially unchanging substrate 
from rapidly evolving mimics such as K3L, it is surprising that most 
present-day hominoid species are resistant to vaccinia K3L (Fig. Ic). 
Our studies reveal evolutionary mechanisms that might allow host 
genes such as PKR to stave off mimicry. This strategy involves not 
only positive selection but also multiple discrimination interfaces 
(aE and «G helices) and a combinatorial outcome of resistance or 
susceptibility based on these surfaces, which together can increase 
discrimination against rapidly evolving mimics. 

PKR seems well suited for molecular arms races against mimics 
because of a striking level of evolutionary flexibility. Because the 
biochemical activity of PKR depends on recognition of an unchan- 
ging substrate, strong purifying selection at the interaction interface 
would be expected. Indeed, other members of the eIF2« kinase 
family, which do not primarily serve antiviral functions and are 
not known to encounter viral mimicry directly, have highly con- 
served «G helices (Fig. 3c) and evolve under purifying selection 
(dN<dS; Supplementary Fig. 1). Despite extensive amino-acid 
diversity in helix «G, variants of PKR retain the ability to recognize 
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in wild-type human PKR results in a variant still resistant to K3L, whereas in 
two of three cases gibbon PKR becomes resistant (indicated by arrows). 

c, Sequence alignments of the «G helix for each member of the eIF2 kinase 
family (PKR, haem-regulated inhibitor (HRI), PKR-like ER kinase (PERK) 
and GCN2) from several mammals highlights the conservation of this region 
compared with the rapid evolution of PKR (black arrowheads indicate 
residues of the «G helix under positive selection in PKR). The frequency of 
substitutions in the panel at each position is indicated by a colour code 
(yellow for a single substitution, orange for a second, red for a third, and blue 
for a fourth), with the human sequence as a reference. Residues making 
contacts with eIF2« are indicated with lines below the PKR alignment. 


eIF2a. The contrasting evolutionary trajectories of helix «G in the 
family of eI[F2x kinases suggests that host factors challenged by 
mimics, such as PKR, rely on a high degree of flexibility to escape 
mimicry. We speculate that substantial selective pressures for distin- 
guishing substrate mimics may even result in substitutions causing a 
decrease in substrate recognition until potential compensatory 
mutations might arise. Consistent with this situation was our finding 
that introducing an ancestral helix «G or one from orangutan into 
PKR from gibbon resulted in slightly compromised substrate recog- 
nition (Fig. 2a, middle, rows 5 and 6; also see Supplementary Fig. 7, 
middle), yet full substrate recognition was restored for helix «G from 
orangutan in the context of the whole protein (Fig. 1c, middle, oran- 
gutan). Compromising one function to explore a greater adaptive 
landscape for another function is probably a theme for genetic gains 
of functional novelty”*”’. Because contending with viral mimicry can 
be essential for combating infectious disease, compromises to com- 
ponents of key cellular processes targeted by mimics’ might be a 
‘hidder’ evolutionary cost of such high-stakes genetic conflicts. 


METHODS SUMMARY 


Primate PKR cDNA was amplified, cloned, and sequenced from a panel of 
hominoids, as well as from Old World and New World monkeys. Poxvirus 
K3L sequences were retrieved from the Poxvirus Bioinformatics Resource 
Center (http://www.poxvirus.org). DNA sequences from each panel were 
aligned and used for phylogenetic and evolutionary analysis with the PAML” 
and HyPhy software packages. Structure observations of PKR were made with 
data coordinates from the Protein Databank (http://www.pdb.org; IDs 2A1A 
and 2A19) and MacPyMol software. 

Variants of PKR and K3L were cloned into yeast vectors, and gene expression 
was driven in transformed yeast strains under a galactose-induced promoter. 
Yeast growth was monitored in serial-dilution series of transformants on plates 
containing selective medium and galactose as a carbon source. Western blots with 
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antibodies raised against PKR, K3L and the haemagglutinin (HA) epitope were 
performed to determine protein levels for yeast strains used in growth assays. 

For virus infection assays, human, orangutan and white-cheeked gibbon 
fibroblasts were infected with 0.001 plaque-forming units per cell of wild-type 
or AK3L vaccinia virus (Copenhagen strain) for 1h. Virus production was 
assessed 72 h after infection by titring cell lysates. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Primate genomic sources. Total RNA was obtained (RNeasy kit; Qiagen) from 
the following cell lines from the Coriell Cell Repositories except where otherwise 
noted: Homo sapiens (human; primary human foreskin fibroblasts), Pan troglo- 
dytes (chimpanzee; AG06939), Pan paniscus (bonobo; AG05253), Gorilla gorilla 
(western lowland gorilla; AG05251), Pongo pygmaeus pygmaeus (Bornean oran- 
gutan; AG05252), Hylobates agilis albibarbis (agile gibbon; PR00773), Hylobates 
leucogenys (white-cheeked gibbon; PR01037), Hylobates syndactylus (island 
siamang; PR00722), Cercocebus atys (sooty mangabey; gift from C. Apetrei), 
Macaca mulatta (rhesus monkey; TF-1 cells), Miopithecus talapoin talapoin (tal- 
apoin monkey; PRO0716), Erythrocebus patas (patas monkey; AG06116), 
Cercopithecus aethiops (African green monkey; COS cells), Trachypithecus fran- 
coisi (Francois’ leaf monkey; PRO1099), Colobus guereza (colobus monkey; 
PRO0980), Lagothrix lagotricha (common woolly monkey; AG05356), Ateles 
geoffroyi (black-handed spider monkey; AG05352), Callicebus moloch (dusky titi; 
AG06115), Aotus trivirgatus (owl monkey; CRL-1556; American Type Culture 
Collection) and Saguinus labiatus (red-chested mustached tamarin; AG05308). 
cDNA cloning and sequences. RNA (50 ng) from each primate was used for RT-— 
PCR (SuperScript III; Invitrogen) with primers listed in Supplementary Table 
10. PCR products were TA-cloned into pCR2.1 (Invitrogen) and sequenced from 
three different clones. The human PKR variant we cloned was identical in 
sequence to the GenBank entry for this gene (NM002759). The PKR cDNA 
sequence from Callithrix jacchus (common marmoset) was obtained by means 
of Blat searches of the UCSC Genome browser with PKR sequences from other 
New World monkeys to aid in identifying exon/intron boundaries. Other mam- 
malian sequences of PKR, HRI, PERK and GCN2 were obtained from GenBank 
or by means of Blat searches of the UCSC Genome browser (http://genome.ucsc. 
edu). Vaccinia (Copenhagen) and variola (major) and other poxvirus K3L 
sequences were obtained from the Poxvirus Bioinformatics Resource Center 
(http://www. poxvirus.org). 

PKR variants were ligated into 2-um (pSB819; URA) and CEN (pSB146; URA) 

yeast pGAL expression plasmids by means of Xhol and Nofl sites introduced into 
PKR primers. K3L or amino-terminal HA-epitope-tagged K3L from vaccinia 
virus (Copenhagen strain) was amplified by PCR, TA cloned, sequenced for 
accuracy, and ligated by means of Xhol and Nofl sites into an integrating 
(pSB305; LEU) yeast expression plasmid into which a galactose promoter was 
also introduced. Variola major K3L sequence was synthesized by standard meth- 
ods (CelTek Genes) and subcloned in the same manner as K3L from vaccinia. For 
comparisons of helix «E the following mammalian sequences of PKR were 
obtained from GenBank: Mus musculus (mouse; NP_035293), Rattus norvegicus 
(rat, NP_062208), Oryctolagus cuniculus (rabbit, NP_001075682), Canis lupus 
familiaris (dog, NP_001041600), Equus caballus (horse, XP_001917876), Bos 
taurus (cow, NP_835210) and Sus scrofa (pig, NP_999484). 
Evolutionary analysis and structure observations. DNA sequences were 
aligned by using ClustalW with small indels trimmed on the basis of amino-acid 
comparisons (Supplementary Data). The generally accepted primate phylogeny 
(Fig. la) was used for evolutionary analysis, although a neighbour-joining tree 
generated from the alignment of PKR placed gorilla and owl monkey at different 
nodes (Supplementary Fig. 8). Parallel analysis with the PKR tree did not alter 
any results significantly (data not shown). Pairwise dN/dS analysis of elF2« 
kinases, elF2x and K3L were performed with K-estimator software’’. 
Maximum-likelihood analysis of the larger PKR data set was performed with 
codeml of the PAML software package'’. A free-ratio model allowing dN/dS 
variation along different branches of the phylogeny was employed to calculate 
dN/dS values between lineages. Two-ratio tests were performed with likelihood 
models comparing all branches fixed at dN/dS = 1 or an average dN/dS value 
from the whole tree applied to each branch to varying dN/dS values according to 
branch. Complementary analysis grouping lineages according to dN/dS values 
with multi-model inference (HyPhy software)** was also applied to the data set. 
We uncovered support for one recombination breakpoint in the data set by using 
the GARD program (HyPhy software; see Supplementary Table 5). 

To detect selection in PKR, the multiple alignment was fitted to either F3x4 or 
F61 codon frequency models. Likelihood ratio tests (LRTs) were performed by 
comparing the following site-specific models (NS sites): M1 (neutral) with M2 
(selection), M7 (neutral, f distribution of dN/dS < 1) with M8 (selection, beta 
distribution, dN/dS> 1 allowed), and M8a (neutral, with class of sites at dN/ 
dS = 1) with M8. Similar LRTs that also account for synonymous rate variation 
and recombination (PARRIS; HyPhy software) were performed. Co-evolution 
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analysis between PKR residues was also performed (Spidermonkey/BGM; 
HyPhy software). 

PAML analysis identified sets of amino acids with high posterior probabilities 

(more than 0.90) for positive selection by a Bayesian approach. Similar analysis 
identified amino acids under positive selection with the LRTs implemented in 
the SLAC, FEL and REL programs (HyPhy software). Amino acids under positive 
selection in the kinase domain were examined using the PKR-elF2 structure 
data coordinates available in the Protein Databank (PDB IDs 2A1A and 2A19; 
http://www.pdb.org)'° and MacPyMol software*. 
Yeast strains and growth assays. Standard techniques were used for culturing 
and transforming yeast strains**. Strain H2557 was provided by T. Dever’! and 
modified by integrating K3L or HA-K3L under the gal promoter at the leu2 
locus. Integration of K3L alleles in the resulting strains HM1 and HM2 were 
confirmed by PCR with primers flanking the Jeu2 locus. HM3 was generated 
from H2557 by transforming empty pSB305 linearized with EcoRV. Genotypes 
of these strains are shown in Supplementary Table 11. 

Strains HM1, HM2, HM3, HM4 and J223 (S51A allele of elF2a; provided by T. 
Dever)” were transformed with PKR variants in 2-um plasmid pSB819 for 
growth assays. Transformants were grown in YC-leu-ura medium (yeast com- 
plete minimal medium with amino acids) containing 2% glucose, then washed 
and plated in dilution series of Doo = 3.0, 0.3, 0.03, 0.003 with the use of a 
bacterial replicator (Aladin Enterprises) on YC-leu-ura medium containing 
either 2% glucose or 2% galactose and grown for 6 days (the human chimaera 
set shown in Fig. 3 and Supplementary Fig. 5 was grown for 10 days). Growth 
assays with PKR variants expressed from CEN plasmid pSB146 yielded consist- 
ent results (data not shown). 

Western blotting. Transformants were grown to saturation in YC-leu-ura med- 
ium with 2% glucose, then washed and diluted 1:50 in YC-leu-ura medium with 
2% galactose and grown for 15 h. Whole-cell lysates were prepared” and resolved 
by SDS-PAGE (12% Tris-glycine gel; Invitrogen). Proteins were transferred to 
nitrocellulose membranes and detected with anti-PKR antibody B-10 (1:1,000 
dilution; Santa Cruz Biotechnology), anti-HA.11 (1:1,000; Covance) or a mono- 
clonal antibody against K3L (1:2,000; a gift from J. Tartaglia). 

Primate PKR infection assays. Human, orangutan and white-cheeked gibbon 
fibroblasts were maintained in Eagle’s minimal essential medium with Earle’s 
salts and non-essential amino acids, supplemented with 10% fetal bovine serum 
(Gibco), penicillin-streptomycin (100 U ml ') and 2 mM L-glutamine. Vaccinia 
virus Copenhagen strain (VC2)*° and VVAK3L (ref. 37), both obtained from B. 
Jacobs, were propagated and titred in BSC4y cells. Growth and titration of VV 
stocks were performed essentially as described**, except that virus stocks were 
partly purified after cell lysis by centrifugation through a 36% sucrose cushion 
before resuspension in 1 mM Tris-HCl pH 9.0, division into aliquot, and storage 
at —70 °C. 

Human, orangutan and gibbon fibroblasts in triplicate wells were infected 
with VC2 or VVAK3L at 0.001 plaque-forming units per cell for 1h, washed 
twice, and re-fed with medium. At 72h after infection the infected cells were 
collected and freeze-thawed three times, and the resulting lysates were titred on 
BSCyp cells*”. 
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Endochondral ossification is required for 
haematopoietic stem-cell niche formation 


Charles K. F. Chan'*, Ching-Cheng Chen'*, Cynthia A. Luppen’*, Jae-Beom Kim’, Anthony T. DeBoer’, Kevin Wei’, 
Jill A. Helms’, Calvin J. Kuo®, Daniel L. Kraft’ & Irving L. Weissman! 


Little is known about the formation of niches, local micro-environ- 
ments required for stem-cell maintenance. Here we develop an in 
vivo assay for adult haematopoietic stem-cell (HSC) niche forma- 
tion’*. With this assay, we identified a population of progenitor 
cells with surface markers CD45~Tie2~ ay*CD105'Thyl1.1— 
(CD105*Thy1~) that, when sorted from 15.5 days post-coitum 
fetal bones and transplanted under the adult mouse kidney capsule, 
could recruit host-derived blood vessels, produce donor-derived 
ectopic bones through a cartilage intermediate and generate a mar- 
row cavity populated by host-derived long-term reconstituting 
HSC (LT-HSC). In contrast, CD45~Tie2~ ay*CD105*Thy1* 
(CD105*Thy1*) fetal bone progenitors form bone that does not 
contain a marrow cavity. Suppressing expression of factors 
involved in endochondral ossification, such as osterix and vascular 
endothelial growth factor (VEGF), inhibited niche generation. 
CD105*Thyl~ progenitor populations derived from regions of 
the fetal mandible or calvaria that do not undergo endochondral 
ossification formed only bone without marrow in our assay. 
Collectively, our data implicate endochondral ossification, bone 
formation that proceeds through a cartilage intermediate, as a 
requirement for adult HSC niche formation. 

Growth and renewal in many tissues are initiated by stem cells, 
supported by the niches in which they reside’*. Although recent 
work has begun to describe functional interactions between stem cells 
and their niches, little is known about the formation of stem-cell 
niches. Identification of the cells and processes that can generate, 
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sustain and influence the HSC niche and haematopoiesis are critical 
for our understanding of normal haematopoiesis; stem-cell homing, 
trafficking and differentiation; and haematopoietic pathology*”’. 
There is a need for modular systems in which the cellular and 
molecular components of a niche can be genetically modified and 
studied in vivo. 

We have established an in vivo assay that allows functional assess- 
ment of the formation and maintenance of HSC niches at an ectopic 
site. We hypothesized that circulating HSC would colonize a non- 
haematopoietic location and establish functional haematopoiesis if 
appropriate niche components were present. We selected the subcap- 
sular site of the kidney because it possesses a rich vascular supply, 
supports several kinds of tissue engraftment and is not known to 
contain HSCs. In mice, HSCs are not detectable in the limb bone 
rudiment until 17.5 days post-coitum (d.p.c.)'’. In our initial experi- 
ments, we showed that transplantation of 14.5 d.p.c. fetal bones under 
the kidney capsule, into either green fluorescent protein (GFP) trans- 
genic or CD45 congenic hosts, resulted in the formation of donor- 
derived bones with host-derived marrow and HSCs (Supplementary 
Fig. 2). This result indicates that 14.5 d.p.c. fetal bones contain 
elements that can initiate an ectopic niche. To determine if the fetal 
bones must be intact for niche initiation, we dissociated the 14.5 d.p.c. 
fetal bones into a single cell suspension, embedded the suspension in 
Matrigel, and introduced it under the kidney capsule. The suspension 
generated both cartilaginous and membranous bones that were popu- 
lated with phenotypic and functional HSC (Fig. la-e and 


Figure 1| Fetal bone cells can initiate an ectopic 
HSC niche. a, Ectopic bone formed by GFP- 
labelled, 14.5 d.p.c. fetal bone cells 32 days after 
subrenal capsule transplant (scale bar, 500 |um). 
b, Representative section of graft stained with 
pentachrome (yellow, osteoid; greenish blue, 
cartilage), cartilaginous region in black circle 
(scale bar, 100 ttm). ¢, Safranin-O stain of 
adjacent section, red-staining cartilage matrix in 
yellow circle (scale bar, 200 1m). d, Alizarin Red 
stain for calcified tissue (scale bar, 500 um). 

e, Time course study of haematopoietic 
components during ectopic niche formation. 
Haematoxylin and eosin staining (upper panel; 


scale bar, 100 um), representative FACS profiles 
of LT-HSC (CD45 "lineage c- 
kit*Scal*CD150*) frequency that were pre- 
gated for live, CD45 "lineage cells (lower panel). 
Days after transplantation are indicated; n = 4 


for each time point. 
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Figure 2 | CD105* Thy1” population forms ectopic HSC niche through a 
cartilaginous intermediate. a, Representative FACS profile of homogenized 
15.5 d.p.c. fetal bones pre-gated for live CD45 Ter119 cells showing 
CD105*Thyl,CD105*Thyl* and CD105° Thy1~ populations (green, red 
and blue gates, respectively). b, c, Two thousand double-sorted fetal bone 
cells from each fraction were injected under the renal capsule and harvested 


Supplementary Figs la, b and 5). To distinguish donor from host 
tissue, we transplanted either fetal bones from GFP transgenic mice 
or wild-type fetal bone suspension transduced with lentiviral-GFP. 
The haematopoietic and vascular components within the ectopic 
niche were host derived; non-haematopoietic and non-vascular com- 
ponents, including bone and cartilage, were donor derived 
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16 days (n = 4) (b) or 32 days (n = 13) (c) after transplantation. Bright-field 
(upper panel) and GFP images (middle panel) of explanted ectopic grafts. 
Pentachrome staining of transverse sections through grafts at 16 and 32 days 
(lower panel). (Scale bar in upper and middle panels, 500 jim; in lower panel, 
100 um.) 


(Supplementary Figs 2-4). The engraftment and activity of host 
HSC within these ectopic niches has been verified by surface marker 
phenotype and functional long-term engraftment assays in secondary 
recipients (Supplementary Figs la, b and 5). We did not detect HSCs 
either in the ungrafted kidneys of transplanted mice or in kidneys 
transplanted with Matrigel only (Supplementary Fig. 1a, b). To deter- 
mine the kinetics of HSC colonization relative to ectopic bone forma- 
tion, we evaluated both the presence of LT-HSC and histological 
parameters of bone formation at 8-day intervals over a period of 32 
days. Donor-derived bone was present at day 16 post-transplant, 
coincident with the appearance of erythrocytes (Fig. le) and host- 
derived PECAM™ vasculature (Supplementary Figs 3 and 4). By day 
24, c-kit” progenitors appeared; however, host-derived HSC were not 
detected until day 32. The day 32 grafts were structurally similar to 
normal bones with regions of cartilaginous, compact and trabecular 
bone (Fig. 1b—e). The presence of HSCs was found to be stable after the 
ectopic niche was established (data not shown). 


Figure 3 | Niche formation is dependent on endochondral ossification. 
a-—c, Suppression of SLF expression in fetal bone cells did not alter 
osteogenesis or niche formation. Fetal bone cells (14.5 d.p.c.) were 
transduced with lentivirus as indicated, and 2,000 sorted GFP* cells were 
injected under the renal capsule. a, Paraffin-embedded sections were 
obtained 32 days after transplantation and stained with pentachrome. 

b, Representative FACS profiles of pre-gated, live CD45" lineage’ cells 
harvested from intact 15.5 d.p.c. normal (wild type) or mutant (SI/SI) fetal 
bones 40 days after transplantation (n = 4). c, Frequency of LT-HSC in 
different ectopic niches shown as mean values + s.e.m. (wild-type bone, 

n = 3; SI/SI bone n = 4; sh-Mock, n = 6; sh-SLF, n = 3). d, Knockdown 
efficiency of sh-SLF and sh-osterix was determined by quantitative reverse 
transcription PCR (qRT-PCR) in a 1A5 osteoblast cell line. Relative 
expression percentage is shown as mean values + s.d. (n = 3). e, Suppression 
of osterix expression disrupted osteogenesis and niche formation. Fetal bone 
cells from 13.5 d.p.c. (upper panel) or 14.5 d.p.c. (lower panel) were 
transduced with the lentivirus indicated and 2,000 sorted GFP* cells were 
transplanted (n = 4). f, VEGF is required for formation of bone marrow 
cavity. Adult C57B1/6 mice were intravenously injected with (multiplicity of 
infection = 10°) adenovirus constructs the expressing either mouse Fc (Ad- 
Fc) or the soluble ectodomain of VEGFR2 (Ad-solVEGFR2) (n = 3). Two 
days after virus injection, 13.5 d.p.c. fetal bone elements were transplanted 
under the renal capsule. Paraffin-embedded sections were obtained 25 days 
after transplantation and stained with pentachrome. (Scale bars in 
brightfield images, 500 jm; in pentachrome-stained images, 100 [um.) 
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To characterize the progenitors responsible for bone formation and 
their role in haematopoietic niche formation and maintenance fur- 
ther, we fractionated fetal-bone cell suspensions with a panel of cell- 
surface markers for: putative mesenchymal stem cells (CD105 and 
Thyl.1); the angiopoietin receptor that is on a variety of haemato- 
poietic and vascular cells (Tie2) (ref. 8); haematolymphoid cells 
(CD45); and a vascular integrin (%y integrin)'*. CD105 and Thy1.1 
are also expressed on haematopoietic and endothelial cells, but these 
cell types were negatively gated from the skeletal progenitors with 
CD45 and Tie2, respectively. By transplanting distinct donor fractions 
sorted by flow cytometry, we identified the minimal progenitor popu- 
lation required for the formation of a functional ectopic HSC niche. 
We found that CD105*Thyl_ progenitors consistently gave rise to 
bone with incorporated HSC-containing-marrow, whereas equal 
numbers of CD105*Thy1” progenitors resulted in bone formation 
without marrow (Fig. 2a-c and Supplementary Fig Ic). 
CD105 Thyl” populations did not form bones or niches efficiently. 
Thus far, we have not observed marrow formation without bone, 
which suggests that bone forming cells such as osteoblasts, osteoblast 
progenitors and/or osteoblast-associated cells may be at least indir- 
ectly or structurally important for niche formation. It is important to 
note, however, that neither osteoblasts nor osteoblast progenitors 
alone were sufficient to initiate niche formation and HSC engraftment 
as the CD105*Thyl* population generated marrowless bone. To 
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Figure 4 | Skeletal progenitors from mandible and calvaria do not form 
HSC niches efficiently. a, b, Representative FACS profile of homogenized 
15.5 d.p.c. mandible (n = 6; PI, propidium iodine) (a) or calvaria (n = 6) 
(b) pre-gated on live CD45 Terl19 cells. ¢, d, Two thousand sorted GFP* 
CD105*Thy1_. cells from mandible (c) or calvaria (d) were transplanted 
under renal capsule and harvested after 32 days. e, Pentachrome-stained 
cross section of mouse parietal bone at 4 weeks. f, Pentachrome-stained cross 
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differentiate between the CD105* Thy1” bone niche-generating and 
the CD105* Thy1”* bone-generating populations, we conducted time- 
course studies to compare the respective mechanisms of bone forma- 
tion. We transplanted equal numbers of each population and har- 
vested the grafted kidneys at 16 and 32 days post-transplantation 
for histological characterization. We found that only 
CD105*Thyl~ progenitors formed bones through a cartilage inter- 
mediate, also known as endochondral ossification, whereas 
CD105*Thy1* progenitors formed bones without a detectable cartil- 
age intermediate (Fig. 2b, c). Furthermore, we found that expression 
of osteocalcin, a marker of mature osteoblasts’’, was fivefold higher in 
CD105*Thy1* populations derived from 15.5 d.p.c. fetal bones 
(Supplementary Fig. 1d, e). These results suggest that 
CD105*Thyl* progenitors may have lost chondrocyte potential’®. 
To assess the importance of specific candidate factors for niche 
development and HSC maintenance, we suppressed the expression of 
steel factor (SLE), osterix and VEGF. SLF is essential for adult hae- 
matopoiesis and HSC activity’’, and is expressed both by mature 
osteoblasts’* and endothelial cells'’. We inhibited expression of SLF 
by transducing fetal bone suspensions with a GFP-labelled, SLF- 
specific short hairpin RNA (shRNA) lentiviral vector before renal 
capsule transplant (Fig. 3d). We observed normal osteogenesis 
and niche formation when SLF expression was inhibited (Fig. 3a, 
c). HSC engraftment was similar between mock-transduced and 


section of equivalent area in 15 d.p.c. fetal calvaria. g—i, Marrow pockets in 
calvaria are concentrated in facial areas corresponding to cartilaginous 
regions (n = 3). Limb bones are juxtaposed to skull for comparison. h, Dorsal 
and lateral views of newborn calvaria show cartilaginous regions that stain 
with alcian blue. (Scale bar in brightfield and GFP images, 500 um; in 
pentachrome images, 100 tum, in (f), 25 jum.) 
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SLF-deficient grafts, consistent with our previous observation that 
SLF production is not essential for haematopoiesis during the fetal 
period”®”'. To confirm the results of the knockdown, we transplanted 
intact 14.5 d.p.c. fetal bones from SLF null mutant (SI/SI) and again 
did not observe defects in either osteogenesis or niche forma- 
tion (Fig. 3b, c). This indicates that the SLF produced by skeletal 
progenitors and mature bone tissue is not required for initiation or 
maintenance of niche activity. We next silenced expression of osterix 
(Fig. 3e), a transcription factor necessary for endochondral ossifica- 
tion, in fetal bone suspensions using lentivirus vector for osterix- 
specific shRNA’'”?. The osterix knockdown severely inhibited 
osteogenesis and abolished niche formation, underscoring the 
dependence of niche formation on the process of endochondral ossi- 
fication. Because perichondrial cells and chondrocytes in the devel- 
oping limb express high levels of VEGF, and vascular invasion is 
critical to endochondral ossification”, we tested whether VEGF 
activity was required for niche formation. We suppressed endogen- 
ous VEGF by injecting the host mice with adenovirus expressing 
soluble VEGF receptor (soluble Flk1; Ad-solVEGFR2), a known 
VEGF-inhibiting reagent™*. We found that endochondral ossification 
was disrupted when 13.5 d.p.c. fetal bones were transplanted into 
hosts treated with Ad-solVEGFR2 but not into hosts treated with 
control virus (Ad-Fc). The 13.5 d.p.c. fetal bones grafted into Ad- 
solVEGFR2 mice displayed an accumulation of chondrocytes with- 
out perfusing vasculature and marrow cavity, although total HSC 
numbers in the host were not significantly affected (Fig. 3f)**. To test 
further the hypothesis that niche formation requires endochondral 
ossification, we isolated CD105* Thyl” progenitors from the regions 
of fetal mandible and calvaria that form bone primarily through 
intramembranous ossification’, which occurs without a cartilagin- 
ous intermediate. In these experiments CD105*Thyl mandibular 
and calvarial progenitors could only form marrowless bones, even 60 
days after transplantation (Fig. 4). These results suggest that endo- 
chondral ossification is necessary for niche formation. 

A better functional understanding of the HSC niche was gained by 
observing how it is formed. In this study, CD105*Thyl skeletal 
progenitors isolated from fetal limb bones initiated ectopic HSC 
niche formation. The progenitors gave rise to donor-derived chon- 
drocytes, which recruited host-derived vasculature into the centre of 
the developing bone graft. As endochondral ossification proceeded, 
the recruited vasculature facilitated the filling of the niche with host- 
derived haematopoietic cells: first erythroid and myeloid, then c-kit* 
progenitors, and finally the HSCs. We do not yet know if the 
CD105*Thy1~ cells represent a homogeneous population, or phe- 
notypically similar but heterogeneous subsets. In agreement with our 
findings, Sacchetti et al. recently identified CD146* subendothelial 
cells residing in adult human bone marrow stroma that can generate 
both bone and marrow when transplanted under the skin of immu- 
nodeficient mice’. Although their study did not verify the presence 
of LT-HSC in the subdermal grafts, it is possible that osteoprogeni- 
tors in the adult marrow are involved in maintaining the niche. In 
addition to identifying the fetal-bone-derived skeletal progenitors 
that are capable of both endochondral ossification and HSC niche 
formation, our study provides a functional framework by which, in 
combination with previously described methods®’, HSC-niche 
interactions can be further investigated at the cellular level. 


METHODS SUMMARY 


C57/BL6 CD45.1/2 congenic mouse strains were derived and maintained in our 
laboratory. Timed embryos from GFP transgenic HZ mice were used in most of 
the fetal bone transplantation studies. Sl/+ mice were purchased from the 
Jackson laboratory. 

Skeletal progenitors were isolated from fetal bones (humerus, radius, tibia, 
femur, pelvis, mandible without the condyle, and the individual frontal and parietal 
bones by collagenase digestion). Next, they were stained with antibodies against 
CD45, Tie2, uy integrin, CD105 and Thyl.1 for fractionation by fluorescence- 
activated cell sorting (FACS). Sorted and unsorted skeletal progenitors were then 
injected underneath the renal capsule of 8- to 12-week-old anaesthetized mice. 


LETTERS 


SLF and osterix-specific shRNA knockdown constructs, active lentiviral stock 
and non-silencing shRNA constructs were generated as previously described”* 
(Supplementary Table 1). Fetal bone cell suspensions were transduced for 48h 
with specific shRNA vectors or control, sorted for GFP expression and trans- 
planted as described. 

To assess HSC engraftment in ectopic niches, grafted regions were dissected 
from kidney and crushed by mortar and pestle. Dissociated cells were stained 
with fluorochrome-conjugated antibodies against CD45, lineage (CD3, CD4, 
CD5, CD8, B220, Gr-1, Mac-1 and Ter119), c-kit, Sca-1 and CD150 for FACS 
analysis. Sorted KLS, CD150* LT-HSC were transplanted into lethally irradiated 
(800 rad delivered in split dose) 8- to 12-week-old congenic recipients by intra- 
venous injection for functional analysis. Peripheral blood was obtained from the 
tail vein at 4 and 23 weeks after LT-HSC transplantation to assess donor-derived 
contributions by FACS. 

Histological analyses of endochondral ossification were performed on sec- 
tions that were obtained from either fresh frozen, optimal cutting temperature 
media (OCT)-embedded or formaldehyde-fixed, paraffin-embedded specimens. 
Representative sections were stained with either haematoxylin and eosin, 
Movat’s modified pentachrome”’, Safranin-O or Alizarin Red stains, depending 
on the experiments. 

RNA was extracted from sorted cells using Trizol (Invitrogen) or RNeasy RNA 
isolation kits (Qiagen) and was reverse-transcribed into complementary DNA 
(cDNA) with SuperscriptRT II (Invitrogen). SYBR Green Universal Master Mix 
and a GeneAmp 7000 or 7500 fast sequence detection system (Applied 
Biosystems) were used for real-time PCR with the primers listed in 
Supplementary Table 2. Relative expression was calculated for each gene by 
the comparative CT (27 44Cr) method with f-actin for normalization. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. C57BL/Ka-Thy1.1-CD45.1 (HZ), C57BL/Ka-Thy1.1-CD45.1 (BA), C57BL/ 
Ka-Thy1.2-CD45.1 (Ly5.2) and C57BL/Ka-Thyl.2-CD45.1 (B6) strains were 
derived and maintained in our laboratory. Timed embryos from GFP transgenic 
HZ mice were used in most of the fetal bone transplantation studies. shRNA studies 
used fetal bones from BA or HZ embryos. Eight- to twelve-week-old 
RAG2 ‘~yc ‘~ mice in B6 background were used as recipients for fetal bone 
transplantation. B6 or Ly5.2 mice were used as recipients in LT-HSC transplanta- 
tion assays. SI/+ mice were purchased from the Jackson laboratory. SI/Sl embryos 
were screened by genomic PCR before transplantation. All animals were maintained 
in Stanford University Laboratory Animal Facility in accordance with Stanford 
Animal Care and Use Committee and National Institutes of Health guidelines. 
Isolation and transplantation of fetal skeletal progenitors. Fetal skeletal ele- 
ments (humerus, radius, tibia, femur, pelvis, mandible without the condyle, and 
the individual frontal and parietal bones) were dissected from euthanized mouse 
fetuses and digested in collagenase with DNase at 37 °C for 40 min under con- 
stant agitation. After collagenase treatment, undigested materials were gently 
triturated by repeated pipetting. Total dissociated cells were filtered through 
40-u1m nylon mesh, pelleted at 200g at 4°C, resuspended in staining media 
(2% fetal calf serum in PBS), blocked with rat IgG and stained with fluoro- 
chrome-conjugated antibodies against CD45, Tie2, ay integrin, CD105 and 
Thy1.1 for purification by flow cytometry sorting. Sorted and unsorted skeletal 
progenitors were pelleted and resuspended in 2 ull of Matrigel, then injected 
underneath the renal capsule of 8- to 12-week-old anaesthetized mice. 

shRNA transduction. SLF and osterix-specific shRNA knockdown constructs 
and active lentiviral stocks were generated as previously described 
(Supplementary Table 1). Fetal bone cells were resuspended in 2MEM medium 
with 15% FCS and transduced with lentiviral vectors carrying SLF or osterix- 
targeting shRNA, non-silencing shRNA or empty vector. Forty-eight hours after 
transduction, cells were sorted for GFP expression and transplanted as described. 
Suppression of SLF or osterix by shRNA was verified by qRT-PCR in an osteo- 
blastic cell line, 1A5, that expresses SLF and osterix (a gift from J.-B.K.). 
Analysis of HSC engraftment in ectopic niches. Kidneys were removed from 
host mice, kept on ice and imaged. The grafted regions were dissected from 
the surface of the host kidney and then crushed between frosted, precleaned 
micro-slides (VWR). Homogenized tissues were filtered through 40-j1m nylon 


nature 


mesh. Mononuclear cells were isolated using Histopaque 1107 according to the 
manufacturer’s protocol. Cells were washed with staining media, then blocked 
with rat IgG and stained with fluorochrome-conjugated antibodies purchased 
from eBiosciences against CD45, lineage (CD3, CD4, CD5, CD8, B220, Gr-1, 
Mac-1 and Terl19), c-kit, Scal and CD150 for flow cytometry analysis. 
LT-HSC functional assay. Sorted LT-HSC (lineage-, c-kit’, Sca-1+, CD150*, 
CD45*) and 3 X 10° helper marrow cells were transplanted into lethally irra- 
diated (800 rad delivered in split dose) 8- to 12-week-old congenic recipients by 
injection into the retro-orbital sinus. Peripheral blood was obtained from the tail 
vein at 4 and 23 weeks after LT-HSC transplantation to assess donor-derived 
contributions by flow cytometry. 

Histological analysis of endochondral ossification. Dissected specimens were 
fixed in 2% PFA at 4°C overnight, then decalcified in 0.4 M EDTA in PBS (pH 
7.2) at 4°C for 2 weeks. Specimens were then processed for embedding in 
paraffin (by dehydration in alcohol and xylene) or OCT (by cryoprotection in 
sucrose) and sectioned. Representative sections were stained with either haema- 
toxylin and eosin, Movat’s modified pentachrome”’, Safranin-O or Alizarin Red 
stains, depending on the experiments. 

Immunofluorescent histology. Immunofluorescence on cryopreserved ectopic 
bone specimens was performed using an M.O.M. immunodetection kit from 
Vector Laboratories according to the manufacturer’s instructions. Briefly, speci- 
mens were treated with a blocking reagent, than probed with monoclonal anti- 
body at 4°C overnight. Specimens were next washed with PBS, probed with 
alexa-dye-conjugated antibodies, washed, coverslipped and imaged with a 
Leica DMI6000B inverted microscope system. Rat anti-PECAM(CD31) mono- 
clonal antibody was purchased from Abcam. Alexa-dye-conjugated Goat anti- 
Rat secondary antibodies were purchased from Molecular Probes. 

RNA extraction and qRT-PCR. RNA was extracted from sorted cells using 
Trizol (Invitrogen) or RNeasy RNA isolation kits (Qiagen) and was reverse- 
transcribed into cDNA with SuperscriptRT III (Invitrogen). SYBR Green 
Universal Master Mix and a GeneAmp 7000 or 7500 fast sequence detection 
system (Applied Biosystems) were used for real-time PCR with the primers listed 
in Supplementary Table 2. Relative expression was calculated for each gene by the 
2-AACT method with B-actin for normalization. 


29. Garvey, W. et al. Improved Movat pentachrome stain. Stain Technol. 61, 60-62 
(1986). 
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Pulsed contractions of an actin-myosin network drive 


apical constriction 


Adam C. Martin’?, Matthias Kaschube°* & Eric F. Wieschaus!” 


Apical constriction facilitates epithelial sheet bending and invag- 
ination during morphogenesis’’. Apical constriction is conven- 
tionally thought to be driven by the continuous purse-string-like 
contraction of a circumferential actin and non-muscle myosin-II 
(myosin) belt underlying adherens junctions*’. However, it is 
unclear whether other force-generating mechanisms can drive this 
process. Here we show, with the use of real-time imaging and 
quantitative image analysis of Drosophila gastrulation, that the 
apical constriction of ventral furrow cells is pulsed. Repeated con- 
strictions, which are asynchronous between neighbouring cells, 
are interrupted by pauses in which the constricted state of the cell 
apex is maintained. In contrast to the purse-string model, con- 
striction pulses are powered by actin—myosin network contrac- 
tions that occur at the medial apical cortex and pull discrete 
adherens junction sites inwards. The transcription factors Twist 
and Snail differentially regulate pulsed constriction. Expression of 
snail initiates actin-myosin network contractions, whereas 
expression of twist stabilizes the constricted state of the cell apex. 
Our results suggest a new model for apical constriction in which a 
cortical actin—-myosin cytoskeleton functions as a developmentally 
controlled subcellular ratchet to reduce apical area incrementally. 

During Drosophila gastrulation, apical constriction of ventral cells 
facilitates the formation ofa ventral furrow and the subsequent inter- 
nalization of the presumptive mesoderm. Although myosin is known 
to localize to the apical cortex of constricting ventral furrow cells*"’, 
it is not known how myosin produces force to drive constriction. 
Understanding this mechanism requires a quantitative analysis of cell 
and cytoskeletal dynamics. We therefore developed methods to 
reveal and quantify apical cell shape with Spider—GFP, a green fluor- 
escent protein (GFP)-tagged membrane-associated protein that out- 
lines individual cells (Fig. la, b, Supplementary Fig. 1 and 
Supplementary Video 1)’*. Ventral cells were constricted to about 
50% of their initial apical area before the onset of invagination and 
continued to constrict during invagination (Fig. lc, e). Although the 
average apical area steadily decreased at a rate of about 5 um? min |, 
individual cells showed transient pulses of rapid constriction that 
exceeded 10-15 tm’ min ' (Fig. 1d, f, g, and Supplementary Video 
2). During the initial 2 min of constriction, weak constriction pulses 
were often interrupted by periods of cell stretching. However, at 
2 min, constriction pulses increased in magnitude and cell shape 
seemed to be stabilized between pulses, leading to net constriction 
(Fig. 1d). These two phases probably correspond to the ‘slow/apical 
flattening’ and ‘fast/stochastic’ phases that have been described pre- 
viously'*"*. Overall, cells underwent an average of 3.2 + 1.2 constric- 
tion pulses over 6min, with an average interval of 82.8 + 48s 
between pulses (mean + s.d., n = 40 cells, 126 pulses). Constriction 
pulses were mostly asynchronous between adjacent cells (Fig. 1h 
and Supplementary Video 3). As a consequence, cell apices between 


constrictions seemed to be pulled by their constricting neighbours. 
Thus, apical constriction occurs by means of pulses of rapid constric- 
tion interrupted by pauses during which cells must stabilize their 
constricted state before reinitiating constriction. 

To determine how myosin might generate force during pulsed 
constrictions, we simultaneously imaged myosin and cell dynamics 
by using myosin regulatory light chain (spaghetti squash, or squ) 
fused to mCherry (Myosin—mCherry) and Spider-GFP. Discrete 
myosin spots and fibres present on the apical cortex formed a net- 
work that extended across the tissue (Fig. 2a and Supplementary Fig. 
2a). These myosin structures were dynamic, with apical myosin spots 
repeatedly increasing in intensity and moving together (at about 
40nms~') to form larger and more intense myosin structures at 
the medial apical cortex (Fig. 2c, Supplementary Fig. 2b, c, and 
Supplementary Video 4). This process, which we refer to as myosin 
coalescence, resulted in bursts of myosin accumulation that were 
correlated with constriction pulses (Fig. 2b-e and Supplementary 
Video 5). The peak rate of myosin coalescence preceded the peak 
constriction rate by 5-10 s, suggesting that myosin coalescence causes 
apical constriction (Fig. 2e). Between myosin coalescence events, 
myosin structures, including fibres, remained present on the cortex, 
possibly maintaining cortical tension between constriction pulses 
(Fig. 2c). Contrary to the purse-string model, we did not observe 
significant myosin accumulation at cell-cell junctions. To confirm 
that constriction involved medial myosin coalescence and not con- 
traction of a circumferential purse-string, we correlated constriction 
rate with myosin intensity at either the medial or junctional regions 
of the cell. Apical constriction was correlated more significantly with 
medial myosin (Fig. 2f), suggesting that, in contrast to the purse- 
string model, constriction is driven by contractions at the medial 
apical cortex. 

Myosin coalescence resembled contraction of a cortical actin— 
myosin network'*'®. Therefore, to determine whether apical constric- 
tion is driven by pulsed contractions of the actin—myosin network, we 
examined the organization of the cortical actin cytoskeleton. In fibro- 
blasts and keratocytes, actin network contraction bundles actin fila- 
ments into fibre-like structures’®’’. Consistent with this expectation 
was our identification of an actin filament meshwork underlying the 
apical cortex in which prominent actin—myosin fibres spanning the 
apical cortex appeared specifically in constricting cells (Fig. 3a and 
Supplementary Fig. 3a). An actin—myosin network contraction model 
would predict that myosin coalescence results from myosin spots 
exerting traction on each other through the cortical actin network. 
To test whether myosin coalescence requires an intact actin network, 
we disrupted the actin network with cytochalasin D (CytoD). 
Disruption of the actin network with CytoD resulted in apical myosin 
spots that localized together with actin structures and appeared spe- 
cifically in ventral cells (Supplementary Fig. 3b, c). Myosin spots in 
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Figure 1| Apical constriction of ventral furrow cells is pulsed. a, Diagram 
of the imaging approach used to show apical constriction of the ventral 
furrow cells. We selected tangential Z-slices 2 |1m below the apical surface 
(red slices) to show cell outlines. b, Z-slices (top) and YZ cross-sections 
(bottom) of cell membranes revealed with Spider—GFP. Scale bar, 10 tum. 
c, d, Apical areas (c) and constriction rates (d) for individual cells of a 
representative embryo. Each row represents data (see colour bars) for an 
individual cell. e, Mean apical area (red) and furrow depth (black). Dotted 
line indicates when tissue invagination initiates. Error bars indicate s.d. 
(n= 41 cells). f, g, Quantification (f) and time-lapse images (g) of the 
constriction of an individual cell. The red arrows (c, d) and red dots (g) mark 
the cell that is quantified in f. C, contraction. S, stabilization. Scale bar, 4 um. 
(h) Pulsed constriction is asynchronous in neighbouring cells. Constriction 
rate is colour-coded (see colour bar) and mapped onto the corresponding 
cells in images at different time points. 


CytoD-injected embryos showed more rapid movement than those in 
control-injected embryos, suggesting that apical myosin spots in 
untreated embryos are constrained by the cortical actin network 
(Supplementary Fig. 3d). Although myosin movement was uninhi- 
bited in CytoD-treated embryos, myosin spots failed to coalesce and 
cells failed to constrict (Fig. 3b and Supplementary Fig. 3e). Because 
myosin coalescence requires an intact actin network, we propose 
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that pulses of myosin coalescence represent contractions of the 
actin—myosin network. 

Because actin—myosin contractions occurred at the medial apical 
cortex, it was unclear how the actin—myosin network was coupled to 
adherens junctions. We therefore imaged E-Cadherin—GFP and 
Myosin—mCherry to examine the relationship between myosin and 
adherens junctions. Before apical constriction, adherens junctions 
are present about 4 {tm below the apical cortex'®. As apical constric- 
tion initiated, these subapical adherens junctions gradually disap- 
peared and adherens junctions simultaneously appeared apically at 
the same level as myosin*'’. This apical redistribution of adherens 
junctions occurred at specific sites along cell edges (midway between 
vertices; Supplementary Fig. 3f). As apical constriction initiated, 
these sites bent inwards. This bending depended on the presence of 
an intact actin network, which is consistent with contraction of the 
actin—myosin network generating force to pull junctions (Supple- 
mentary Fig. 3f). Indeed, myosin spots undergoing coalescence were 
observed to lead adherens junctions as they transiently bent inwards 
(Fig. 3c). Thus, pulsed contraction of the actin—myosin network at 
the medial cortex seems to pull the cell surface inwards at discrete 
adherens junction sites, resulting in apical constriction. 

The transcription factors Twist and Snail regulate the apical con- 
striction of ventral furrow cells*”*. Snail is a transcriptional repres- 
sor whose target or targets are currently unknown, whereas Twist 
enhances snail expression and activates the expression of fog and 148, 
which are thought to activate the Rhol GTPase and promote myosin 
contractility*'°'°*'**. To examine the mechanism of pulsed apical 
constriction further, we tested how Twist and Snail regulate myosin 
dynamics. In contrast to wild-type ventral cells, in which myosin was 
concentrated on the apical cortex (Fig. 2a), twist and snail mutants 
accumulated myosin predominantly at cell junctions, similarly to 
lateral cells (Fig. 4a). These ventral cells failed to constrict produc- 
tively, which supported our cortical actin-myosin network contrac- 
tion model, rather than the purse-string model, for apical 
constriction. twistand snail mutants differentially affected the coales- 
cence of the minimal myosin that did localize to the apical cortex. 
Although myosin coalescence was inhibited in snail mutants, it still 
occurred in twist mutants, as did pulsed constrictions (Fig. 4a and 
Supplementary Video 6). This difference was also observed when 
Snail or Twist activity was knocked down by RNA-mediated inter- 
ference (referred to as snailIRNAi or twistRNAi) (Supplementary Fig. 
4a and Supplementary Video 7). However, the magnitude of con- 
striction pulses in twistRNAi embryos was greater than that of twist 
mutant embryos, suggesting that the low level of Twist activity pre- 
sent in twistRNAi embryos enhances contraction efficiency by activ- 
ating the expression of snail or other transcriptional targets. Myosin 
coalescence was inhibited in snail twist double mutants, demonstrat- 
ing that the pulsed constrictions in twist mutants required snail 
expression (Fig. 4a and Supplementary Video 6). Thus, the express- 
ion of snail, not twist, initiates the actin-myosin network contrac- 
tions that power constriction pulses. 

Net apical constriction was inhibited in both snailRNAi and 
twistRNAi embryos (Supplementary Fig. 4b). We therefore wondered 
why the pulsed contractions that we observed in twistRNAi embryos 
failed to constrict cells. Using Spider-GFP to visualize cell outlines, we 
found that although constriction pulses were inhibited in snailRNAi 
embryos, constriction pulses still occurred in twistRNAi embryos 
(Fig. 4b, c, Supplementary Fig. 4c and Supplementary Video 8). 
However, the constricted state of cells in twistRNAi embryos was 
not stabilized between pulses, resulting in fluctuations in apical area 
with little net constriction (Fig. 4b, c). This stabilization defect was not 
due to lower snail activity, because these fluctuations continued when 
snail expression was driven independently of twist by using the P[sna] 
transgene (Fig. 4b). Although the frequency and magnitude of con- 
striction pulses in such embryos were similar to those in control 
embryos, stretching events were significantly higher in twistRNAj; 
P[sna] embryos, suggesting a defect in maintaining cortical tension 
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Figure 2 | Constriction pulses are correlated with myosin coalescence. 

a, Merged images of Myosin—-mCherry (Z-projection, 5 jum depth, green) 
and Spider—GFP (individual Z-slice 2 um below the apical cortex, red). YZ 
cross-sections at lower magnification to illustrate furrow progression are 
shown at the bottom. b, Mean apical area and myosin intensity (left) and 
myosin intensity for individual cells (right) for a representative embryo. 
Error bars indicate s.d. (n = 37 cells). ¢, Single channel and merged time- 
lapse images of Myosin—mCherry (green) and Spider—GFP (red). Red arrows 
indicate spots that will coalesce. Blue arrow indicates myosin fibre that 
appears between contractions. d, Plots of apical area and myosin intensity 
against time (top) and constriction rate and rate of change of myosin 
intensity against time (bottom) for an individual cell. e, Plot of correlation 
between constriction rate and myosin intensity rate of change for individual 
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Figure 3 | Pulsed myosin coalescence and adherens junction bending 
require an actin-myosin network. a, Cortical myosin (green), cortical 
F-actin (red), and F-actin 2 1m below the apical cortex (white, to illustrate 
cell shape) were revealed in fixed embryos. b, Time-lapse images of 
Myosin-GFP in control-injected (DMSO) and CytoD-injected embryos. 
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cells (top) and averaged (n = 37 cells, bottom) against time offset. 
Correlation coefficients were calculated for various time offsets by 
temporally shifting the data sets relative to each other. Dotted lines indicate 
zero offset. Note that the maximum correlation occurs when myosin rate is 
shifted 5-10 s later in time; myosin coalescence therefore slightly precedes 
constriction rate. f, Constriction rate is more highly correlated with medial 
myosin than with junctional myosin. The diagram (left) illustrates the 
purse-string model for contraction in which we expect actin and myosin to 
become concentrated in the junctional region on constriction. Data points 
(right) represent correlation coefficients for individual cells, and the black 
bar indicates the mean (n = 37 cells). Asterisk, the difference between the 
means is statistically significant (P < 0.0001). Scale bars, 4 1m. 
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Arrows indicate individual myosin spots. Note that myosin spots move, but 
do not coalesce, in CytoD-treated embryos. c, Single-channel and merged 

time-lapse images of Myosin—mCherry (green) and E-Cadherin—GFP (red). 
Red arrows indicate myosin coalescence. Blue arrows indicate the site where 
adherens junctions bend inwards beneath a myosin spot. Scale bars, 4 jum. 
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Figure 4 | Snail and Twist function at distinct phases of pulsed 
constriction. a, Time-lapse images of Myosin—GFP Z-projections. Blue 
arrows indicate myosin spots that do not efficiently coalesce in snail 
mutants. Red arrows indicate myosin coalescence in twist mutants that 
seems to pull cell junctions. At least one coalescence event that pulled cell 
junctions occurred over a 6-min period for 53% of cells in the twist mutant, 
in contrast with 4% of cells in snail and snail twist mutants (n = 60 cells, 
three embryos per mutant). Scale bar, 4 1m. b, Time-lapse images of 
Spider-—GFP in snailRNAi or twistRNAi embryos. P{sna] indicates twist- 
independent snail expression. Scale bar, 4 1m. ¢, Quantification of apical 
area (red) and constriction rate (blue) for individual cells in snaiIRNAi (left) 
and twistRNAi (right) embryos. d, Ratchet model of apical constriction. 
Myosin (green) contracts an apical actin network (red) that is coupled to 
adherens junctions (blue) driving constriction. Contractions are pulsed, 
interrupted by a phase in which the constricted state of the cell is stabilized. 


(Supplementary Fig. 4d). This defect might result from a failure to 
establish a dense actin meshwork, because both twist mutants and 
twistRNAi embryos had a more loosely arranged apical meshwork 
of actin spots and fibres than constricting wild-type cells did 
(Supplementary Fig. 4e). twist expression therefore stabilizes the con- 
stricted state of cells between pulsed contractions. 
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Thus, we propose a ‘ratchet’ model for apical constriction, in 
which phases of actin—myosin network contraction and stabilization 
are repeated to constrict the cell apex incrementally (Fig. 4d). In 
contrast to the purse-string model, we find that apical constriction 
is correlated with pulses of actin—myosin network contraction that 
occur on the apical cortex. Pulsed cortical contractions could allow 
dynamic rearrangements of the actin network to optimize force gen- 
eration as cells change shape. Because contractions are asynchronous, 
cells must resist pulling forces from adjacent cells between contrac- 
tions. A cortical actin-myosin meshwork seems to provide the cor- 
tical tension necessary to stabilize apical cell shape and promote net 
constriction. The transcription factors Snail and Twist are critical for 
the contraction and stabilization phases of constriction, respectively. 
Thus, Snail and Twist activities are temporally coordinated to drive 
productive apical constriction. Despite the dynamic nature of the 
contractions in individual cells, the behaviour of the system at the 
tissue level is continuous, in a similar manner to convergent exten- 
sion in Xenopus”. Pulsed contraction may therefore represent a con- 
served cellular mechanism that drives precise tissue-level behaviour. 


METHODS SUMMARY 

Image acquisition and analysis. Two-colour imaging was performed at 22— 
25°C with a Leica SP5 confocal microscope, a 63X/1.3 numerical aperture 
glycerine-immersion objective, an argon ion laser and a 561-nm diode laser. 
Spider-GFP images represent confocal slices 21m below the apical cortex, 
whereas myosin images represent maximum-intensity Z-projections of an apical 
section 5 um in depth. Image stacks for Spider-GFP movies were acquired every 
6s, and image stacks for two-colour movies were acquired every 5s. Using 
MATLAB (MathWorks) we developed methods to track cells and measure apical 
area and myosin intensity. Data points were smoothed with a Gaussian smooth- 
ing filter with o = 15-18 s (three time points). Myosin intensity was measured 
from maximum-intensity Z-projections as the sum intensity of all pixels in a cell. 
Mean myosin intensity was calculated for junctional and medial pools of myosin 
by creating masks that selected regions less than 0.3 4m or more than 0.3 um 
from the cell boundary, respectively. 

Embryo fixation and staining. Heat fixation and staining with anti-myosin 
heavy chain (MHC) antibody did not preserve the normal organization of apical 
myosin observed in live squ-GFP°*, squ-mCherry (Myosin—mCherry), and GFP- 
zipper (GFP-MHC)” embryos. We therefore used endogenous GFP fluorescence 
to reveal myosin. squ-GFP embryos were fixed with 10% paraformaldehyde/ 
heptane for 20 min, devitellinized manually, stained with Alexa-568 phalloidin 
(Invitrogen) to reveal actin, and mounted in AquaPolymount (Poysciences, 
Inc.). 

Drug/RNAi injection. CytoD was injected laterally at mid-late cellularization 
with 0.5mgml~' CytoD in 10% dimethylsulphoxide (DMSO; Calbiochem). 
Double-stranded RNAs against snail and twist (2mgml ') were injected lat- 
erally into freshly laid eggs that were incubated 2.5—3.0 h before gastrulation was 
imaged. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 19 June; accepted 7 October 2008. 
Published online 23 November 2008. 
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METHODS 

Fly stocks and genetics. Fluorescent fusion protein stocks Spider—GFP (III) (95- 
1)'?, Myosin-GFP (II or III) (squ*?, squ-GFP, a gift from R. Karess)**, 
E-Cadherin—GFP (II) (ubi-DE-cad—GFP)"*, and Moesin—GFP (III)** are described 
in the indicated references. Myosin-mCherry (III) (squ—mCherry*"") was recom- 
bined with Spider-GFP or Moesin—GFP to generate Myosin—mCherry Spider— 
GFP/TM3 and Myosin—mCherry Moesin—GFP. Homozygous Myosin—mCherry 
Spider—GFP flies could not be maintained as a stock. Therefore, Myosin—mCherry 
Spider-GFP/Myosin—mCherry flies were used for two-colour imaging. The 
dynamics of Myosin—mCherry and Spider—GFP observed in the two-colour strain 
are indistinguishable from the dynamics of Myosin—GFP and Spider—GFP alone. 
This suggests that the behaviour of myosin we observe with two-colour imaging 
reflects normal myosin dynamics. E-Cadherin—GFP Myosin—mCherry flies used 
for imaging were ubi-DE-cad—GFP shg*?/CyO; squ-mCherry". 

Strategies were used to reveal both homozygous and hemizygous twist and 
snail mutants. First, the squ-GFP transgene was jumped onto the CyO balancer, 
generating CyO-Myosin—GFP. Chromosomes containing sna"©”, twi?°? and 
sna! ti”? were marked with Df(2L)dpp[s7-dp35] 21F1—3;22F 1-2 (halo) to 
allow homozygous embryos to be distinguished from their heterozygous sib- 
lings. We then rebalanced twist and snail mutants to obtain halo twi®”’/CyO- 
Myosin-GFP, halo sna''°/CyO-Myosin-GFP, and halo sna!!@ twi”??/CyO- 
Myosin-GFP. Homozygous mutants were selected by identifying the halo 
mutant phenotype. Myosin accumulation in ventral cells was delayed in twist 
and snail mutants, occurring after cephalic furrow initiation, in contrast to wild- 
type embryos, in which myosin appears before the cephalic furrow is observed. 
The results presented are representative from nine halo sna'© movies, six halo 
twi?®? movies and three halo sna'"°” twi'??? movies. 

Alternatively, a compound second chromosome stock homozygous for 
Myosin-GFP (III) (C(2)v; Myosin—GFP) was used. In the C(2)v stock, the right 
arms (containing twist) and the left arms (containing halo and snail) of chro- 
mosome 2 assort independently. For snail mutants, C(2)v; Myosin—-GFP was 
crossed to halo sna"°/CyO and halo embryos were identified. For twist 
mutants, C(2)v; Myosin—GFP was crossed to halo twi”°? /CyO and one-third 
of the non-halo progeny were twist mutants. Both strategies identified the dis- 
tinct phenotypes for snail and twist mutants described in Fig. 4a. 

To provide twist-independent snail expression we used the P[sna,] (referred to 
as P[sna]) transgene, which contains the snail cDNA downstream of two copies 
of the proximal element of the twist promoter”. One zygotic copy of P[sna,] was 
sufficient to induce invagination in a snail mutant background, indicating that it 
provides enough activity to rescue snail-dependent contraction. 

Construction of Myosin-mCherry. Myosin regulatory light chain (spaghetti 
squash, or squ), including its native promoter, was tagged at the carboxy ter- 
minus with mCherry. A 2-kilobase genomic fragment containing the squ open 
reading frame (ORF) and promoter was inserted into the KpnI and Sall sites of 
pBluescript. The squ 3’ untranslated region (800 base pairs) was then cloned into 
the downstream BamHI and Xbal sites. The sequence for mCherry, including a 
short linker region, was cloned into the Clal and BamHI sites in between the squ 
ORE and 3’ untranslated region. The 3.5-kilobase Kpnl/Xbal fragment contain- 
ing Myosin—mCherry was then cloned into pCasPer4 and sent to BestGene Inc. 
to make transgenic flies. Myosin-mCherry complemented the null squ‘*’ allele, 
demonstrating that it is functional. 

Time-lapse image acquisition. Egg collections were performed in plastic cups 
covered with apple-juice plates. Flies were allowed to lay eggs for 2-4 h at 25 °C 
before the plate was removed and embryos undergoing cellularization were 
collected. Embryos were dechorionated with 50% bleach, washed with water, 
and then mounted on a slide with embryo glue (Scotch tape resuspended in 
heptane), with the ventral side facing upwards. A chamber was made with two 
no. 1.5 coverslips as spacers and was filled with Halocarbon 27 oil for imaging. 
Embryos were not compressed. Mesoderm invagination occurred with a time- 
frame similar to that deduced from fixed embryos, and embryos imaged under 
these conditions could subsequenty hatch, demonstrating that our imaging con- 
ditions had minimal impact on development. 

Single-colour images of Spider-GFP and two-colour images of Myosin— 
mCherry Spider-GFP were obtained with a Leica SP5 confocal microscope, a 
63X/1.3 numerical aperture glycerine-immersion objective, an argon ion laser 
and a 561-nm diode laser. Images were acquired with a pinhole setting of two 
Airy units. For simultaneous two-colour images, we set the excitation bandpass 
to 495-550 nm to detect GFP, and to 578-650 nm to detect mCherry. There was 
minimal bleedthrough between the two channels. Images were acquired at a 
resolution of 141nm per pixel. Myosin—GFP images were obtained with a 
PerkinElmer Ultraview spinning disk confocal microscope, a 60/1.2 numerical 
aperture water-immersion objective, an argon/krypton laser and an Orca ER 4 
charge-coupled device camera (Hamamatsu). 


nature 


Image processing and analysis. The images presented were processed with 
ImageJ (http://rsb.info.nih.gov/ij/) and Photoshop CS (Adobe Systems, Inc.). 
Unless otherwise noted, Spider-GFP images represent a single confocal slice 
21m below the apical cortex, whereas Myosin-GFP, Myosin—mCherry and 
Moesin—GFP images represent maximum-intensity Z-projections 511m in 
depth. A Gaussian smoothing filter with a radius of one pixel was used to reduce 
noise in published images. 

To show apical cell shape, we manually selected Z-slices at a depth that was 
about 2 ttm below the apical surface and the apical myosin. Cell outlines at 2 um 
depth were very similar in dimensions to more apical cell outlines; however, they 
were easier to reveal and lacked membrane irregularities (namely in Fig. 3c) that 
complicated image analysis. Shifts in Z-position did not result in discontinuities 
in the data (Fig. le), and the pulsed behaviour that we describe was also observed 
when a single Z-slice was used. 

We developed methods to measure apical area, constriction rate and myosin 
intensity with MATLAB (MathWorks). Raw images were bandpass-filtered with 
effective cutoff wavelengths of 1.4 jtm (low pass) and 17.9 tm (high pass). After 
thresholding, a series of morphological operations were applied to reduce the 
width of the membranes to one pixel and to remove spurious background 
labelling. Examples of extracted cell outlines are shown in Supplementary Fig. 
1. Indexed cells were automatically tracked on the basis of distances between cell 
centroids at subsequent time points. Cell properties were measured at each time 
point. We manually removed cells with errors in the segmentation to ensure that 
all cells in the data set were correctly identified. Data for apical area and myosin 
were smoothed with a Gaussian smoothing filter (o = 15-18s, three time 
points), and constriction rates and myosin rates of change were calculated from 
the smoothed data. Unless otherwise stated, a constriction pulse was defined as 
an event in which the constriction rate exceeded one standard deviation above 
the mean (more than 10.8 1m’ min” '). To measure myosin intensity, we first 
clipped intensity values below two standard deviations above the mean to sepa- 
rate myosin structures from unselective background labelling. Myosin intensity 
was then measured from maximum-intensity Z-projections (two highest values 
averaged) as the sum intensity of all pixels in a given cell. The correlation between 
constriction rate and the myosin intensity rate of change was determined by 
calculating the correlation coefficients between these two data sets for individual 
cells using the time interval from 1.7 min to 4.8 min. To examine the time 
dependence of this correlation, the data for constriction rate and myosin rate 
of change were shifted in time relative to one another. 

Furrow depth (Fig. le) was calculated by measuring the distance between 
vitelline membrane and apical cortex in YZ cross-sections. 

Myosin-GFP spot velocity was measured using the Manual Tracking plugin 
for ImageJ. We tracked individual spots or other distinct myosin structures for 
the duration of their lifetime and averaged the three highest velocities to calculate 
maximum velocity. For wild-type movies, we calculated the velocity of myosin 
structures coalescing. We chose coalescence events that occurred early or 
occurred near the midline to minimize the effects of tissue movement during 
invagination. 

Drug injection. Embryos were dechorionated in 50% bleach for 2 min, washed 
with water, mounted on the edge of a glass slide (ventral side upward) with 
embryo glue, and desiccated for 4-8 min. A 3:1 mixture of halocarbon 700:halo- 
carbon 27 was used for injection. Embryos were injected laterally at mid-late 
cellularization (furrow canals had passed the base of the nuclei) with about 1% 
egg volume of control or drug solution. We injected 0.5mgml ' CytoD 
(Calbiochem) in 10% DMSO. 

RNAi. Primers for double-stranded (ds)RNA were designed with E-RNAi 
(http://www.dkfz.de/signaling2/e-rnai/)”. Primers included the sequence of 
the T7 promoter (5'-TAATACGACTCACTATAGGG-3’) followed by the fol- 
lowing recognition sequences: Twi01-F, 5’-GCCAAGCAAGATCACCAAAT-3’; 
Twi01-R: 5'-GACCTCGTTGCTGGGTATGT-3’; Twi02-F, 5’-GGAGCTGCA- 
GAACAATGTGA-3’; Twi02-R, 5’-TGCTGTTGCTGGTGGATTAG-3’; Sna01-F, 
5'-CGGAACCGAAACGTGACTAT-3’; Sna01-R, 5’-GCGGTAGTTTTTGGCAT- 
GAT-3'; Sna02-F, 5’-ATCATGCCAAAAACTACCGC-3’; Sna02-R, 5’-AGCGAC- 
ATCCTGGAGAAAGA-3’; Control-F, 5’-GAATGCTATGGGAGGCGATA-3’; 
Control-R, 5'-TCAGCTTAGGCTCTGGGTGT-3’. 

Primer pairs were used to amplify a PCR product from genomic DNA. PCR 
products were used directly in a transcription reaction with T7 polymerase with 
the MEGAscript transcription kit (Ambion). Annealing was performed by add- 
ing 10 mM EDTA, 0.1% SDS and 0.1 M NaCl to the reaction and incubating this 
mixture in a water bath heated to above 90 °C, which was allowed to cool for 
several hours. The dsRNA was purified by extraction with phenol/chloroform 
and resuspended in injection buffer (5mM KCl, 0.1mM sodium phosphate, 
pH7.0). We injected a 2mgml! solution of snail or twist dsRNA into the 
embryo. Identical results were obtained with either dsRNA fragment used to 
knock down twist or snail. Sna01 and Twi01 were used for all the experiments in 
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the manuscript, with the exception of Supplementary Fig. 4e, in which Twi02 was 
used. Control primers amplified a portion of an unknown ORF, CG3651, which 
has no function during ventral furrow formation. Injections were performed as 
described for the drug injections, except that egg collections were performed 
after 30 min to inject embryos at the earliest possible stage. To provide twist- 
independent snail expression we crossed Spider—GFP virgin females to P[sna,]”° 
males and collected embryos for twist dsRNA injection. 

Statistics. Statistical significance between means was determined with an 
unpaired t-test. P values were calculated with Prism 5 (Graphpad Software, Inc.). 


28. Edwards, K. A., Demsky, M., Montague, R. A., Weymouth, N. & Kiehart, D. P. 
GFP-moesin illuminates actin cytoskeleton dynamics in living tissue and 
demonstrates cell shape changes during morphogenesis in Drosophila. Dev. Biol. 
191, 103-117 (1997). 

29. Arziman, Z., Horn, T. & Boutros, M. E-RNAi: a web application to design optimized 
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Major gradients in putatively nitrifying and 
non-nitrifying Archaea in the deep North 
Atlantic 


Héléne Agogué, Maaike Brink, Julie Dinasquet & Gerhard J. Herndl 


Nature 456, 788-791 (2008) 


The yaxis in Fig. 3 of this Letter was incorrectly labelled “CO, fixation 
(m? day~')’. It should read ‘CO, fixation (umol C m~* day~')’. 
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naturejobs 


his issue introduces the 2009 class of Naturejobs Postdoc Journal keepers. 
As usual, the competition for our four spots was intense. This year’s crop 
was chosen from 60 applicants from 20 countries, working in a variety of 
fields — from marine-resource management to behavioural neuroscience. 
The winners demonstrated wit, a knack for writing, intriguing backgrounds and a 
keen insight into the challenges and dilemmas facing postdocs of all backgrounds. 

Three of the journal keepers have more than research to keep them busy, as 
parenthood has introduced the familiar challenges of a work-life balance. Journal 
keeper Julia Boughner, a British-born postdoc in evolutionary developmental 
biology at the University of Calgary in Canada, describes her predicament as “the 
race to secure a tenure-track position before family and other personal demands 
drive me out of my dream of academia”. Boughner models the mechanisms of 
human craniofacial variation in a morphometrics lab. 

Joanne Isaac, studying the impact of climate change on tropical systems and 
species at James Cook University in Queensland, Australia, is also juggling a 
postdoc, motherhood and a relationship. Isaac delved into the life history of brushtail 
possums as a graduate student. Bryan Venters, a postdoc at Pennsylvania State 
University at University Park, is a new father weighing up the merits of industry 
versus academia for his future. Venters studies gene regulatory mechanisms as a 
postdoc; he mapped genome-wide locations of yeast transcription before that. 

Sam Walcott described a different but increasingly common set of challenges: 
how to plan one’s career and training as an interdisciplinary scientist. Trained in 
theoretical and applied mathematics, Walcott is a theoretical biophysics postdoc at 
Johns Hopkins University, Baltimore, where he focuses on molecular biomechanical 
models related to muscles. He wonders if a fledgling interdisciplinary scientist, by 
focusing on just one discipline during his postdoc, adversely affects his career. 

| hope readers will look forward to tracking our journal keepers’ progress as they 
pursue their own career aims. And | offer my thanks to all those who applied. 

Gene Russo is editor of Naturejobs. 
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MOVERS 


Linda Birnbaum, director, National Institute 
of Environmental Health Sciences, Research 
Triangle Park, North Carolina 


2008-present: Senior 
toxicologist, National 
Center for Environmental 
Assessment, Research 
Triangle Park, North Carolina 
1989-2008: Director, 
Experimental Toxicology 
Division, National Health 
and Environmental Effects 
Research Laboratory, 
Research Triangle Park, 
North Carolina 


After 16 months without a director, the US National 
Institute of Environmental Health Sciences (NIEHS) in 
Research Triangle Park, North Carolina, has a new head. 
Toxicologist Linda Birnbaum, who has spent much of her 
career at the Environmental Protection Agency (EPA), took 
the helm at the institute this month. 

Birnbaum started off as a biologist at the University 
of Rochester in New York, and was soon attracted to the 
burgeoning fields of molecular biology and molecular 
genetics. Her PhD at the University of Illinois in Urbana saw 
her map the ribosomal RNA genes of Escherichia coli. After 
a postdoc at the University of Massachusetts in Amherst 
and a stint at Kirkland College in Clinton, New York, 
Birnbaum took a post at the Masonic Medical Research 
Laboratory in Utica, New York, where she studied ageing. 
Here began her career-defining work in toxicology. 

In studying how altered metabolism could affect ageing, 
Birnbaum modulated metabolism in rats using dioxins 
and polychlorinated biphenyls. She looked at how these 
chemicals break down in the human body and for the next 
10 years explored the molecular effects of dioxins, analysing 
the relative toxicity of related chemicals, and designing long- 
term bioassays to assess cancer risk. “If anybody had told 
me that 30 years later I'd still be working with that family of 
chemicals, | wouldn't have believed them,” she says. 

Taking over as head of the EPA's Environmental Toxicology 
Division in 1989, Birnbaum expanded its ranks to an all-time 
high of 90 full-time employees and dozens of students 
and postdocs. Despite budget cuts, she found ways to 
continue toxicology studies by partnering with other federal 
agencies or academia. She spearheaded some of the first 
work documenting the mechanism of action of endocrine- 
disrupting chemicals, such as brominated flame retardants. 

Kenneth Ramos, president of the Society of Toxicology, 
hopes that Birnbaum will help the NIEHS, and the maturing 
field of environmental health, better define itself. “With 
an internationally recognized toxicologist as its leader, 
the institute can now focus its efforts and have an impact 
on disease causes," he says. “Linda has the expertise and 
conviction to inspire and grow the institute on many levels. 

Birnbaum is encouraged by the incoming administration 
of Barack Obama's stated commitment to science, health 
and the environment. “I plan to create a holistic approach 
that can deal with the biggies, from complex mixtures of 
toxic chemicals to climate change,” she says. | 
Virginia Gewin 
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Research assistants join a union 


Research assistants at the Research 
Foundation of the State University 

of New York (SUNY) in Stony Brook 
have decided to unionize — the latest 
development in ongoing unionization 
battles at US universities. Nearly 

all who voted last month to join the 
Communication Workers of America 
(CWA) are working in science, says 
Matthew Engel, a Stony Brook research 
assistant who campaigned for union 
representation. Frustrated by issues 
such as fees and job insecurity, they 
are seeking benefits comparable to 
those received by teaching assistants. 

Research assistants are graduates 
temporarily doing academic research 
at a college, university or non- 
university institution. They usually 
work on projects supervised by 
full-time academics who administer 
the funds that provide their salaries. 
Teaching assistants generally receive 
fixed pay from the university. 

In 2004, the federal National 
Labor Relations Board ruled that 
research assistants are students, 
not employees, and so could not 
be represented by a union. But a 
2007 board ruled that those at 
the SUNY Research Foundation in 
Albany, Buffalo and Syracuse were 
fundamentally employees. 

Teaching assistants are already 
represented by the CWA. “We found 


out that they had negotiated some 
benefits, and we thought it was a 
good idea,” says Engel, a doctoral 
student in Stony Brook's biomedical 
engineering department. According 
to the union website, its local branch 
now represents more than 4,000 
research and teaching assistants 
throughout the SUNY system. Engel 
hopes to negotiate reduced fees and 
improved pay and health benefits. 
SUNY research assistant Luigi 
Longobardi emphasizes the 
importance of job security for non-US 
research assistants whose visa status 
depends on funding. “If we're going to 
be without funding, we should have 
a fair amount of time to find another 
adviser or alternative sources of 
funding,” he says. 

According to the Coalition of 
Graduate Employee Unions, some 
30 unions cover graduate employees 
on more than 60 campuses. 

“The Research Foundation 
is committed to following all 
laws and regulations related to 
collective bargaining,” a foundation 
spokeswoman says. The university 
has raised no objections to the vote. 

Engel says the research assistants 
are selecting a negotiating team and 
expect to sign a contract with the 
Research Foundation soon. a 
Karen Kaplan 
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A pregnant pause 


What is the science behind short-term memory loss? As the sleep-deprived 
mum of a six-month-old boy, I'd enjoy knowing the technical aspects of my 
hamstrung brain. | study craniofacial development and evolution with the 
aim of understanding the molecular mechanisms underlying coordinated 
morphological change in the teeth and jaws, mostly in primates. 


I've had frequent memory lapses since returning to the lab in December after 
half a year of leave. Finicky bench work was challenging enough when my pre- 
offspring focus was at its sharpest. | will have to compensate for my sideways 
mind by finding tactics that enable me to be productive while managing the 
baby’s demands on my time. | am my own work-life balance guinea pig. 

The next 12 months will be about finding my balance as a partner anda 
parent, regaining my stride as a scientist, and achieving my career goals. | aim 
to be a tenure-track assistant professor with my own lab and minions. (What's 
a bona fide scientist without minions?) Will | cut it? Or will | cut out? Private 
industry may offer me a more liveable (and lucrative) work-life equilibrium. | 
must thoroughly research how my skills could best be applied in an enjoyable 
non-scientific career. And | must talk to people who have successfully made 
this leap. This is my quandary and my journey as a postdoc. | 
Julia Boughner is a postdoc in evolutionary developmental biology at the University of 
Calgary, Canada. 
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Replacement 


Welcome back. 


Shelly Li 


You stand over her grave, tears splattering 
onto the 100-year-old dirt. It’s been ages 
since you saw her in the flesh, since you 
watched that crooked smile spread from 
one end of her face to the other. 

God, she was so beautiful, so flawless... 
and youre so drunk, you almost think 
about dropping to your knees, digging 
for her. But instead you clutch the 
whiskey bottle closer to your heart 
and tell her, like you always do: 
“Happy birthday’, and walk away. 

The next year you are back, 
and the next. You know what you 
want to happen, but after so many 
years, the pain never fades, and the 
memory of her has never left your 
mind for a single second. 

Then one afternoon you show 
up, high off your ass and ready to 
collapse in front of the tombstone, 
when you see someone else stand- 
ing in your spot. 

“Hey, you say, stumbling over. 
“What are you doing?” 

The man turns to you. “You 
must be the poor bastard she 
left behind.” He turns off the 
PRIVATE reading in his brain 
chip, giving you his name and 
information. 


Owen Powers, Brain-Chip 
Transferor 


You understand and don't have 
to read further. You look up at the 
man, study him for a while. He 
looks to be about 20, 25 years old, 
but in the year 2230, who can tell 
the difference? He may well be, 
like you, rounding the year 200. 

“So,” you say. “It’s been so long, I was 
afraid that you people didn't even exist” 

“Well I'm here now, aren't I?” 

You pause after that, think for a minute 
before saying: “And you're here to offer 
me your services?” You've spent too many 
years on this godforsaken planet to display 
too much emotion, only to be stabbed by 
lies, over and over again. 

But the words that come out of Powers’s 
mouth next make your heart beat to an 
irregular tune again, bring life back into 
your old, empty soul. “Yes. We will return 
your wife to you,” he says. 

A little smile touches your lips, but 
you dont let it travel far before stomping 


it back down again. “It’s been more than 
50 years.” 

“Yes, it has, and we're sorry that it’s taken 
this long to move down the wait list to you. 
Our scientists had some technical issues to 
resolve on the new models, but we think 
that you will be pleased with the progress 
we have made.” 

This time a wind of joy flies in and slams 
into your chest, lights you all the way up. 


“Okay,” you say, “then how does this 
work?” 

“We will pull your wife’s brain-chip 
records, put them inside a body and ship it 
to you. You wont get to choose the body we 
put her in, but we'll try to make an android 
that looks close enough” 

You shrug. It was never the outer appear- 
ance that was the most beautiful, but the 
angel on the inside. The only question left 
is: “Will she be the exact same person?” 

“The brain-chip recorder stops as soon 
as the heart does. Trust me, it'll be like she 
never left.” 

The doorbell rings two weeks later, and 
you stand up, shuffle to the door. 

Each step seems heavier than the one 
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before, as each heartbeat becomes harder 
and harder to slow down. 

Love, warmth, any feeling you thought 
youd lost forever when she left is back 
again, pumping through your body like 
liquid fire. 

Your finger hesitates for a second on 
the OPEN DOOR button, but you finally 
unfreeze yourself and pound on it. 

The door slides open... 

Your whole body stops, and 
your brain spins into orbit. 

It's her. It’s really her. 

You reach out, pull her into your 
arms and take a deep breath... 

But then you realize that some- 
thing’s wrong. 

She used to smell like pome- 
granate and raindrops. Now all 
that’s left is the scent of nothing- 
ness. 

Her hair, she had a cowlick in 
the back, where a patch of rebel 
hair always stuck up. This woman 
standing before you, however — 
her hair is annoyingly perfect. 

Her outer shell feels like the skin 
ofa human, but you know that she 
never will be. All she’s done is taint 
your memory of her. 

“Come in,” you say with a smile 
on your face, taking her hand and 
leading her into the house. 

You walk with a tightening 
inside you to the drawer, pull out 
your gun. Before she can even say 
a word, you pump three bullets 
into her chest. She falls to the floor, 
crackling with electricity, emitting 
ascreen of a smoke. Amid the fire 
starting in your living room, you 
squeeze a final shot, and she stops 
moving completely. 

As you stand over her body, 
tears start to slide down your face once 
again. “I'm sorry,’ you say, “but an android 
can never replace her” 

You scoff at yourself and look around 
the big, empty house. This was a stupid, 
stupid idea. Love and loneliness blinded 
you, drove you to accept this offer. 


But what's done is done. 

Now the curtain of tricks has parted, and 
you see the truth. 

The sealing hollowness of it. a 
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